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Preface 


This textbook collects the notes for an introductory course in measure 
theory and integration taught by the authors to undergraduate students of 
Scuola Normale Superiore in the last 10 years. 

The goal of the course was to present, in a quick but rigorous way, 
the modern point of view on measure theory and integration, putting Le- 
besgues theory in M" into a more general context and presenting the ba¬ 
sic applications to Fourier series, calculus and real analysis. The text can 
also pave the way to more advanced courses in probability, stochastic 
processes or geometric measure theory. 

Prerequisites for the book are a basic knowledge of calculus in one and 
several variables, metric spaces and linear algebra. 

All results presented here, as well as their proofs, are classical. We 
claim some originality only in the presentation and in the choice of the 
exercises. Detailed solutions to the exercises are provided in the final part 
of the book. 

Pisa, July 2011 

Luigi Ambrosio, Giuseppe Da Prato 
and Andrea Mennucci 



Introduction 


This course consists of an introduction to the modern theories of measure 
and of integration. Historically, this has been motivated by the necessity 
to go beyond the classical theory of Riemann’s integration, usually taught 
in elementary Calculus courses on the real line. It is therefore useful to 
describe the reasons that motivate this extension. 

(1) It is not possible to give a simple, handy, characterization of the class 
of Riemann 's integrable function, within Riemann’s theory. This is indeed 
possible within the stronger theory, due essentially to Lebesgue, that we 
are going to introduce. 

(2) The extensions of Riemann’s theory to multiple integrals are very 
cumbersome. This extension, useful to compute areas, volumes, etc., is 
known as Peano-Jordan theory, and it is sometimes taught in elementary 
courses of integration in more than one variable. In addition to that, im¬ 
portant heuristic principles like Cavalieri’s one can be proved only under 
technical and basically unnecessary regularity assumptions on the do¬ 
mains of integration. 

(3) Many constructive processes typical of Analysis (limits, series, in¬ 
tegrals depending on a parameter, etc.) cannot be handled well within 
Riemann’s theory of integration. For instance, the following statement is 
true (it is a particular case of the so-called dominated convergence the¬ 
orem)’. 

Theorem 1. Let f h : [— 1, 1] —> M be continuous functions pointwise 
converging to a continuous function /. Assume the existence of a con¬ 
stant M satisfying \fh(x)\ < M for all x e [—1, 1] and all h e N. Then 



J fix) dx 


Even though this statement makes perfectly sense within Riemann’s the¬ 
ory, any attempt to prove this result within the theory (try, if you don’t 
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believe!) seems to fail, and leads (more or less explicitely, see [2]) to 
a larger theory. In addition to that, the continuity assumption on the 
limit function / is not natural, because a pointwise limit of continuous 
functions need not be continuous, and we would like to give a sense to 
/_!i f( x )dx even without this assumption. This necessity emerges for 
instance in the study of the convergence of Fourier series 

OO 

/ 0) = E cij cos(/7rx) + bi sin(/7rx) r e [-1,1]. 

/=o 

In this case the uniform convergence of the series, which implies the 
continuity of / as well, is ensured by the condition JT |a, | + | b, | < 
oo. On the other hand, we will see that the “natural” condition for the 
convergence (in a suitable sense) of the series is much weaker: 

OO 

J2 a d + b f < 00 • 

i=0 


Under this condition the limit function / need not be continuous: for 
instance, if /(x) = 1 for x e [—1/2, 1/2] and /(x) = 0 otherwise, then 
we will see that the coefficients of the Fourier series are given by b, = 0 
for all i (because / is even) and by 


cii 


1 

2 


if i = 0; 


sin(7r//2) 

in 


if i > 0. 


(4) The spaces of integrable functions, as for instance 

H := j/ : [—1, 1] —> M : J f 2 (x)dx < oo 
endowed with the scalar product 

(f’g) ■= J f( x )g( x )dx 

and with the (pseudo) induced distance d(f , g) — (f — g, f — g) l/2 , 
are not complete , if we restrict ourselves to Riemann integrable functions 
only. In this sense, the path from Riemann’s to Lebesgue’s theory is the 
same one that led from the (incomplete) set of rational numbers Q to the 
(complete) real line R. 
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Lebesgue’s theory extends Riemann’s theory in two independent direc¬ 
tions. The first one is concerned, as we already said, with more general 
classes of functions, not necessarily continuous or piecewise continuous 
(the so-called Borel or measurable functions). The second direction can 
be better understood if we remind the very definition of Riemann’s integ¬ 
ral 



i=l 


where t\ — — 1, t n — 1 and the approximation is better and better as the 
parameter sup /<n t i+ \ — /,■ tends to 0. More generally, instead of integrat¬ 
ing with respect to the “length” measure, we can integrate with respect to 
a generic measure fi and define 


n 



(i) 


where Ai, ..., A n is a partition of [—1, 11, x, e A, and the approximation 
is expected to be better and better as the parameter sup, diarnfA, ) tends to 
0. We may think, for instance to [—1, 1] as a possibly non-homogeneous 
bar, and to fi(A) as the “mass” of the subset A of the bar: because of 
non-homogeneity, fi(A) need not be proportional to the length of A. 

Once we adopt this viewpoint, we will see that it is not hard to obtain a 
theory of integration in general metric spaces, and even in more general 
classes of spaces. On the other hand, the approximation (1), that in any 
case clarifies the intuitive meaning of the integral, will remain valid for 
continuous functions only. 


Chapter 1 
Measure spaces 


In this chapter we shall introduce all basic concepts of measure theory, 
adopting the point of view of measures as set functions. The domains 
of measures may have different stability properties, and this leads to the 
concepts of ring, algebra and o -algebra. The most basic tool developed 
in the chapter is Caratheodory’s theorem, which ensures in many cases 
the existence and the uniqueness of a a -additive measure having some 
prescribed values on a set of generators of the a-algebra. In the final 
part of the chapter we will apply these abstract tools to the problem of 
constructing a “length” measure on the real line, the so-called Lebesgue 
measure , and we will study its main properties. 

1.1. Notation and preliminaries 

We denote by N = {0, 1, 2, ...} the set of natural numbers, and by N* the 
set of positive natural numbers. Unless otherwise stated, sequences will 
always be indexed by natural numbers. 

We shall denote by X a non-empty set, by &{X) the set of all parts of 
X and by 0 the empty set. For any subset ,4 of X we shall denote by A c 
its complement A c := [x e X : x £ A}. If A, B e £P(X) we denote 
by A \ B the relative complement A D B c , and by AAB the symmetric 
difference (A \ B) U (B \ A). 

Let (A„) be a sequence in ,-X 1 (X). Then the following De Morgan 
identity holds, 



Moreover, we define (l 1 

OO OO OO OO 

lim sup A„ := P| (J A m , hm inf A„ := (J P| A m . 

n ^°° n =0 m=n n =0 m=n 


Notice the analogy with liminf and limsup limits for a sequence ( a n ) of real numbers. We have 
lim sup a n = inf sup a m and lim inf a n = sup inf a m . This is something more than an analogy, 
n —>• oo neNm>n n-+o o neN m — n 

see Exercise 1.1. 
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As it can be easily checked, lim sup (l A n (respectively lim inf,, A n ) con¬ 
sists of those elements of A that belong to infinitely many A n (respect¬ 
ively that belong to all but finitely many A,,). 

It easy to check that if (A„) is nondecreasing (i.e. A n C A, i+] , n e N), 
we have 

OO 

lim inf A n = lim sup A n — I I A n , 

n— MX) „ 

n ^°° n=0 

whereas if (A„) is nonincreasing (i.e. A n D A n+ j, n e N), we have 

OO 

lim inf A n = lim sup A n = f| A n . 

n —>■ oo „ I I 

n =0 

In the first case we shall write A„ \ L, and in the second one A n \. L. 

1.2. Rings, algebras and a-algebras 

Definition 1.1 (Rings and Algebras). A non empty subset si of & ( X ) 
is said to be a ring if: 

(i) 0 belongs to si ; 

(ii) A, 5 e s/ =>• A U 5, ADB e s/- 

(iii) A, B e s/ => A \ B e s/. 

We say that a ring is an algebra if X & si. 

Notice that rings are stable only with respect to relative complement, 
whereas algebras are stable under complement in A. 

Let Jd c ^ (A). As the intersection of any family of algebras is still 
an algebra, the minimal algebra including ,'XA (that is the intersection of all 
algebras including J(f) is well defined, and called the algebra generated 
by J(f. A constructive characterization of the algebra generated by ,'XA can 
be easily achieved as follows: set = -Xf U {0} and 

,X (i+V) := |J {A U 5, A c : A, B e ,X (i) } ii > 0. 

Then, the algebra si generated by ,'Xf is given by (J ; Indeed, it is 
immediate to check by induction on i that si D and therefore the 
union of the ■X (,) A is contained in si. On the other hand, this union is 
easily seen to be an algebra, so the minimality of si provides the opposite 
inclusion. 

Definition 1.2 (<r -algebras). A non-empty subset A of' AX (A) is said to 
be a a -algebra if: 


(i) S’ is an algebra; 
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(ii) if (A„) is a sequence of elements of 8 then [J A„ € 8. 

n =0 

If 8 is a cr-algebra and (A„) c 8 we have P) n A n e 8 by the De 
Morgan identity. Moreover, both sets 

lim inf A „, lim sup A n , 

n >00 „—v nn 


belong to 8. 

Obviously, {0, X} and 8? {X) are o'-algebras, respectively the smal¬ 
lest and the largest ones. Let 3X8 be a subset of 8X (X). As the intersection 
of any family of a-algebras is still a o--algebra, the minimal o--algebra 
including 8X8 (that is the intersection of all o-algebras including 8X8) is 
well defined, and called the o-algebra generated by 3X8. It is denoted by 
a {3X8). 

In contrast with the case of generated algebras, it is quite hard to give 
a constructive characterization of the generated a -algebras: this requires 
the transfinite induction and it is illustrated in Exercise 1.18. 

Definition 1.3 (Borel a-algebra). If (E. d) is a metric space, the o- 
algebra generated by all open subsets of E is called the Borel cr-algebra 
of E and it is denoted by 38 (E). 

In the case when E = JR the Borel o -algebra has a particularly simple 
class of generators. 

Example 1.4 {38 (M)). Let -X be the set of all semi-closed intervals \a.b) 
with a < b. Then o(-X) coincides with 38 (W). In fact o (,X) contains all 
open intervals (a, b) since 


■ I f 1 \ 1 

(a, b) = I I n H—, b ) with no > --. 

w L n / b — a 

n=n 0 

Moreover, any open set A in M is a countable union of open intervals . <2) 
An analogous argument proves that 38 (M) is generated by semi-closed 
intervals {a, b], by open intervals, by closed intervals and even by open 
or closed half-lines. 


Indeed, let (aj) be a sequence including all rational numbers of A and denote by 4 the largest 
open interval contained in A and containing a ^ . We clearly have A D (JAo h ■ but also the opposite 
inclusion holds: it suffices to consider, for any x e A, r >0 such that (x — r, x r) C A, and k 
such that ak e (a — r, x + r) to obtain (jc — r, x + r) C Ik, by the maximality of Ik, and then x € Ik- 
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1.3. Additive and a -additive functions 

Let stf c & ( X ) be a ring and let /x be a mapping from .&/ into [0, +oo] 
such that /x(0) = 0. We say that /x is additive if 




If /x is additive, A, B e srf and A D B, we have /x(A) = fi(B) + /x(A \ 
B), so that /x(A) > /i(B). Therefore any additive function is nondecreas¬ 
ing with respect to set inclusion. Moreover, by applying repeatedly the 
additivity property, additive measures satisfy 



for n e N* and mutually disjoint sets A \, ..., A n e .&/. 

A set function /x on .e/ is called o-additive if /x(0) = 0 and for any 
sequence (A„) C ^ of mutually disjoint sets such that [J n A n € srf we 
have 



Obviously cr-additive functions are additive, because we can consider 
countable unions in which only finitely many A„ are nonempty. 

Another useful concept is the a -subadditivity, we say that /x is a- 
subadditive if 


00 


d(B) < ^2n(A n ), 


for any B e srf and any sequence (A„) C srf such that B C A„. 
Notice that, unlike the dehnition of a -additivity, the sets A„ need not be 
disjoint here. 

Remark 1.5 (a -additivity and a -subadditivity). Let /x be additive on 
a ring s# and let (A„) C .&/ be mutually disjoint and such that (J n A„ e 
stf. Then by monotonicity we have 

A (Q A„ J > /x | (J A„ j = ^/x(A„), for all k e N. 

\/ 7=0 J \«=0 / / 2=0 

Therefore, letting ffoowe get 
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Thus, to show that an additive function is cr-additive, it is enough to 
prove that it is cr-subadditive. 

Conversely, it is not difficult to show that a-additive set functions are 
a -subadditive: indeed, if B c U „A n we can define A ( j = B fl A f) and 
A! n := B fl A n \ U„,<„ A m for n £ N*, so that B is the disjoint union of 
the sets A' n , to obtain 

OO 00 

li(B) = /x(A„). 

n=0 n=0 

Let fi be additive on g/. Then a -additivity of fi is equivalent to con¬ 
tinuity of fi in the sense of the following proposition. 

Proposition 1.6 (Continuity on nondecreasing sequences). If fi is ad¬ 
ditive on a ring stf, then (/) (//), where: 

(i) fi is a -additive; 

(ii) (A n ) c sd and A £ g/, A n \ A =>■ /i(A„) f fi(A). 

Proof. (i)=Kii). In the proof of this implication we can assume with no 
loss of generality that fi(A n ) < oo for all n £ N. Let (A n ) c g/, A £ g/, 
A n \ A. Then 

OO 

A = Ao U |^J(A„ + i \ A„), 

n =0 

the unions being disjoint. Since fi is cr-additive, we deduce that 

OO 

fi(A) = fi(A 0 ) + VV/x(A„ +1 ) - fi(A n )) = lira fi(A n ), 


and (ii) follows. 

(ii)=>(i). Let (A„) c g^ be mutually disjoint and such that A := 

U„ A » 6 Set 

m 

B„, := [J Ak. 

k=0 

m 

Then B m f A and fi(B m ) = fi(Af) f fi(A) by the assumption. This 

o 

implies (i). □ 

Proposition 1.7 (Continuity on nonincreasing sequences). Let fi be 

cr-additive on a ring .&/. Then 

(A n ) C gtf and A e g/, A n \ A, fi(A 0 ) < oo => fi(A n ) \ fi(A). 

( 1 . 1 ) 
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Proof. Setting B„ := A 0 \A„, B A 0 \ A, we have B n \ B, therefore the 
previous proposition gives t /z(5). As /x(A„) = /x(A 0 ) — /x(5„) 

and /x(A) = /x(A 0 ) — /z(5) the proof is achieved. □ 

Corollary 1.8 (Upper and lower semicontinuity of the measure). Let 

fi be a-additive on a a-algebra S and let (A„) C S. Then we have 

li (liminf A„ ) < liminf fi(A„) (1.2) 

V «->oo / n—>oo 

and, if fi(X) < oo, we have also 

lim sup /u (A„) < li limsupA,,). (1.3) 

>oo \ n—> oo / 

Proof. Set L := lim sup (l A„. Then we can write 

OO OO 

L = P| B n where B n := [J A m . (1.4) 

n =0 m=n 

Now, assuming /x(A) < oo, by Proposition 1.7 it follows that 
H(L) = lim ia(B„) = inf n(B n ) > inf sup/x(A m ) = lim sup /i(A n ). 

n ~m>n n—>o o 

Thus, we have proved (1.3). The inequality (1.2) can be proved similarly 
using Proposition 1.6, thus without using the assumption ii(X) < oo. □ 
The following result is very useful to estimate the measure of a lim sup 
of sets. 

Lemma 1.9. Let /i be a-additive on a o-algebra S and let (A„) c S. 

OO 

Assume that fi(A n ) < oo. Then 

n =0 




lim sup A„ 


= 0 . 


Proof. Set L — lim sup A„ and define B„ as in (1.4). Then the inclusion 

n —>oo 

L c B n gives 


OO 

/x(L) < /x(5„) < ^ /x(A m ) for all n e N. 

m=n 


As n 


oo we find /x(L) = 0. 


□ 
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1.4. Measurable spaces and measure spaces 

Let S be a a -algebra of subsets of A. Then we say that the pair (X, S) 
is a measurable space. Let /x : $ —> [0, +oo] be a cr-additive function. 
Then we call /z a measure on ( X, S’), and we call the triple (X, S’, /x) a 
measure space. 

The measure /z is said to be finite if /x (A) < oo, a-finite if there exists 
a sequence (A„) C S’ such that [J n A n = X and /z(A„) < oo for all 
n e N. Finally, /z is called a probability measure if /z(A) = 1. 

The simplest (but fundamental) example of a probability measure is 
the Dirac mass S x , defined by 


8 X (B) := 


if x e B 
if x £ B. 


This example can be generalized as follows, see also Exercise 1.5 and 
Exercise 1.23. 

Example 1.10 (Discrete measures). Assume that Y c A is a finite or 
countable set. Given c : Y —»■ [0, +oo] we can define a measure on 
(A, ^(A)) as follows: 

p-(B) := C <A) wb c x - 

xeBCY 

Clearly fi = C A)S X is a finite measure if and only if X\ s y c ( x ) < 
oo, and it is er-finite if and only if c(x) e [0, +oo) for all x € Y. 

More generally, the construction above works even when Y is uncount¬ 
able, by replacing the sum with 

sup E c(x), 

ceBHY' 

where the supremum is made among the finite subsets Y’ of Y. The meas¬ 
ures arising in the previous example are called atomic, and clearly if A is 
either finite or countable then any measure /z in (A, fA (A)) is atomic: it 
suffices to notice that 

l 1 = E c(x)S x with c(x) := /z({x}). 

xeX 

In the next section we will introduce a fundamental tool for the construc¬ 
tion of non-atomic measures. 
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Definition 1.11 (//-negligible sets and //-almost everywhere). Given 
a measure space ( X , S, //), we say that B £ S is ii-negligible if //(B) = 
0, and we say that a property P(x) holds //-almost everywhere if the set 

{x £ X : P(x) is false} 

is contained in a //-negligible set. 

Notice that the class of //-negligible sets is stable under finite or count¬ 
able unions. It is sometimes convenient to know that any subset of a 
//-negligible set is still //-negligible. 

Definition 1.12 (//-completion of an-algebra and //-measurable sets). 

Let (X, S’, //) be a measure space. We define 

S jX \= {A e @>{X) : for some B, C eS with //(C) = 0, AAB c C}. 

It is easy to check that S is still a a -algebra, the so-called completion of 
S with respect to //. The elements of (£ are called //-measurable sets. 

It is also easy to check that // can be extended to all A £ S fl simply 
by setting //(A) = //(B), where B £ S is any set such that AAB is 
contained in a //-negligible set of S. This extension is well defined (i.e. 
independent of the choice of B), still cr-additive and //-negligible sets 
coincide with those sets that are contained in some B £ S with //(B) = 
0. As a consequence, any subset of a //-negligible set is //-negligible as 
well. 

1.5. The basic extension theorem 

The following result, due to Caratheodory, allows to extend a er-additive 
function on a ring g/ to a cr-additive function on o (.<//). It is one of the 
basic tools in the construction of non-trivial measures in many cases of 
interest, as we will see. 

Theorem 1.13 (Caratheodory). Let stf c £? (X) be a ring, and let S 
be the o-algebra generated by ,c/. Let //: g/ —> [0, +oo] be a-additive. 
Then // can be extended to a measure on S’. If p is a-finite, i.e. there 
exist A n £ .e/ with A n f X and fi(A n ) < oo for any n, then the extension 
is unique. 

To prove this theorem we need some preliminaries: for the uniqueness 
the Dynkin theorem and for the existence the concepts of outer measure 
and additive set. 
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1.5.1. Dynkin systems 

A non-empty subset X of X (A) is called a n-system if 
A, B e X =*► A(~\ B e X. 


A non-empty subset of X (A) is called a Dynkin system if 

(i) A,06®; 

(ii) Aef ==>• A c e 

(iii) (A,) c 3) mutually disjoint => [J ( A,- e <3). 

Obviously any a-algebra is a Dynkin system. Moreover, if .'A is both a 
Dynkin system and a n --system then it is a cr-algebra. In fact, if (A,) is 
a sequence in 3 of not necessarily disjoint sets we have 

OO 

[_J A,- = A 0 U (Ai \ A 0 ) U ((A 2 \ Ai) \ A 0 ) U ■ ■ ■ 

/=o 


and so (J ; G ^ by (ii) and (iii). 

Let us prove now the following important result. 

Theorem 1.14 (Dynkin). Let X be a n -system and let 3 D X be a 
Dynkin system. Then a(X) C 3. 

Proof. Let 3o be the minimal Dynkin system including X. We are going 
to show that 3 o is a a-algebra which will prove the theorem. For this 
it is enough to show, as remarked before, that the following implication 
holds: 

A, B e 3 0 =^ An B e 3 0 . (1.5) 

For any B e 3o we set 


X(B) = {F e3 o : B n F e 3 0 }. 


We claim that XX B ) is a Dynkin system. In fact properties (i) and (iii) 
are clear. It remains to show that if F fl B e 3o then F c n B e 3o or, 
equivalently, F U B c e 3 q. In fact, since F U B c — (F \ B c ) U B c = 
(FnB)U B c and F fl B and B c are disjoint, we have that F U B c e 3o 
as required. 

Notice first that if K e X we have X C X{K) since X is a 
7T-system. Therefore XX {K ) = 3o, by the minimality of 3o. Con¬ 
sequently, the following implication holds 


K e X, B e 3 0 


KnB e 3 0 , 
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which implies XXL c M’(B) for all B e XXX q. Again, the fact that -XX J ( B) is 
a Dynkin system and the minimality of °X{) give that -XXX ( B ) = S> 0 for all 
B e ® Q . By the dehnition of JXf(B), this proves (1.5). □ 

The uniqueness part in Caratheodory’s theorem is a direct consequence 
of the following coincidence criterion for measures; in turn, the proof of 
the criterion relies on Theorem 1.14. 

Proposition 1.15 (Coincidence criterion). Let /xj, /x 2 be measures in 
(A, S) and assume that: 

(i) the coincidence set 

S> := {A e S : /x^A ) = /x 2 (A)} 

contains a n-system -XT with o(-XS) = S; 

(ii) there exists a nondecreasing sequence (X,-) C -XX with /x i(A,) = 
/x 2 (Xj) < oo and X; f X. 

Then /x t = /x 2 . 

Proof. We first assume that /xi(A) = /x 2 (A) is hnite. Under this as¬ 
sumption LX is a Dynkin system including the 7r-system -XT (stability of 
LX under complement is ensured precisely by the finiteness assumption). 
Thus, by the Dynkin theorem, LX — S’, which implies that = /x 2 . 

Assume now that we are in the general case and let X ,• be given by 
assumption (ii). Fix i e N and define the a-algebra of subsets of X t 
by 

St := {A c Xj : A e S}. 

We may obviously consider /xj and /x 2 as hnite measures in the measur¬ 
able space (Xj, Si). Since these measures coincide on the 7T-system 

Jfi := (A c Xj : Ael} 

we obtain, by the previous step, that /X| and /x 2 coincide on a (.XX t ) C 

■M(Xi). 

Now, let us prove the inclusion 

{B eS : B c X,j C or(AQ. (1.6) 

Indeed 

(Bex : Bn Xj e a{XXTj)) 

is a a -algebra containing -XT (here we use the fact that Xj e ■XX'), and 
therefore contains S . Hence any element of S contained in Xj belongs to 

or(jQ. 

By (1.6) we obtain /xi(B n Xj) = /x 2 (B n Xj) for all B e S and all 
i e N. Passing to the limit as i —> oo, since B is arbitrary we obtain that 
/x i = /x 2 . □ 
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1.5.2. The outer measure 

Let /x be a set function defined on si C EX (EX'). For any E e EX(X) we 
define: 


li*(E) := inf \ ^/x(A,-) : A,- e si, E c [J A,- 

[ i=0 i= 0 

/x* is called the outer measure induced by /x. We can easily show that 
/x* is a nondecreasing set function, namely /x*(£) < n*(F) whenever 
E c F cX. 

We will obtain the proof of the existence part of Caratheodory’s the¬ 
orem by showing in the proposition below that /x* extends /x if /x is a- 
subadditive, and that (Theorem 1.17) /x* is cr-additive on a a-algebra 
containing o (si) if /x is si is a ring and /x is additive on si. In particular 
if /x is cr-additive on .s/ we see that /x* provides the desired cr-additive 
extension to cr(.g/)- 

Proposition 1.16. The set function /x* A a -subadditive on EX (X ) and 
extends /x /J/x is o-subadditive on si and /x (0) = 0. 

Proof. Let (Ef) c EX (X) and set E := E t . Assume that /x*(£,) 
are finite (otherwise the assertion is trivial). Then, since /x*(£,) is finite 
for any i e N, for any e > 0 there exist A, j e si such that 

oo £ °° 

n(Aij) < /x*(£,) + —, Ei c (J Aij, i e N. 

i=o Z l=o 

Consequently 

OO OO 

X>(A;,;)<Xy (^/) + e. 

i , 1=0 i '=0 

oo 

Since £ c U ^xi we have 

>M=o 


OO OO 

M*(£) < £ n(Aij) < ^ 

i,l=0 i'=0 

and the first part of the statement follows from the arbitrariness of e. 

Now, let us assume that /x is a -subadditive on si and choose E e si \ 
since E C 1J/ A' then /x(£) < /x(A,), so we deduce /x*(£) > n(E)\ 

but, by choosing A 0 = E and A„ = 0 for n > 1, we obtain that /x*(£) = 
/x(£). This proves that /x* extends /x. O 
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Let us now define the additive sets , according to Caratheodory. A set 
(A) is called additive if 

/i*(E) = /x*(£Tl A) + /x*(£Tl A c ) for all E e (1.7) 

We denote by the family of all additive sets. 

Notice that, since /x* is subadditive, (1.7) is equivalent to 

I 1 *(E) >/ 1 *(E n A) +/ 1 *(E n A c ) for all E e (1.8) 

Obviously, the class Sf of additive sets is stable under complement; 
moreover, by taking E = A U B with A e and A D B — 0, we 
obtain the additivity property 

/z*(AU5) = /z*(A) + /z*(5). (1.9) 

Other important properties of Sf are listed in the next proposition. 

Theorem 1.17. Assume that .?/ is a ring and that n is additive. Then 
is a a-algebra containing srf and /i* is a-additive on . 

Proof. We proceed in three steps: we show that contains .ft/, that is a 
a-algebra and that /i* is additive on r A. As pointed in Remark 1.5, if ji* 
is a-subadditive and additive on the a-algebra f#, then /x* is a-additive. 
Step 1. srf c *3. Let A e and E e ^(A), we have to show (1.8). 
Assume /i*(E ) < oo (otherwise (1.8) trivially holds), fix e > 0 and 
choose (B,-) C s# such that 

OO OO 

Ec\jBi, n*{E) + £> Y J B(B i ). 

i =0 i =0 

Then, by the definition of /z*, it follows that 

OO OO 

/i*(E) + e > B(Bi) = YfriBi n A) + /x(S, n A c )] 

/=0 i=0 

> ii*(E n A) + /i*(E n a c ). 

Since s is arbitrary we have fi*(E) > fi*(EDA) + fi*(E Pi A c ), and (1.8) 
follows. 

Step 2. <3 is an algebra and /i* is additive on 3. We already know 
that A e 3 implies A c e 3. Let us prove now that if A, B e 3 then 
A U B e 3. For any E e & (A) we have 

H*(E) = I 1 *(E n A) + I 1 *(E n A c ) 

= fi*(E n A) + /i*(E n a c n B) + /i*(E n a c n b c ) (l.io) 


= n A) + n*(E DA C D B)] + ti*{E n (A U B) c ). 
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Since 

(E n A) u (E n A f n B) = e n (A u B), 
we have by the subadditivity of n*, 

H*(E n A) + ii*(E DA c nB)> n*(E n(AU B)). 


So, by (1.10) it follows that 

/i*(E ) > /x*(£ n (A U B)) + fi*(E n (A U B) c ), 


and A U B e as required. The additivity of n* on follows directly 
from (1.9). 

Step 3. <£ is a <r-algebra. Let (A„) C . We are going to show that 
S := (J (j A n e ( /L Since we know that ^ is an algebra, it is not restrictive 
to assume that all sets A„ are mutually disjoint. Set S„ := (Jq A,-, for 
n e N. 

For any n e N, by using the cr-subadditivity of n* and by applying 
(1.7) repeatedly, we get 


n*(E n s c ) + ii*(e n s) < /x*(£ ns c ) + ^ n*(E n a,-) 


i=0 


= lim 

n —>oo 




(E n s c ) + ^ n*{E n a,) 


;= 0 


= lim [ n*(E n S c ) + n*(E n S„)l - 

n—>oo L J 

Since S c C S‘ n it follows that 

fi*(E n S c ) + II*(E n S) < lim sup [//*(£’ n S„) + fi*{E n S c n )] 


n—>oo 

= H*(E). 


So, and ^ is a a -algebra. □ 

Remark 1.18. We have proved that 


a{sf) C <3 C 


( 1 . 11 ) 


One can show that the inclusions above are strict in general, for instance 
when /x is the Lebesgue measure we shall consider in the next section. 
In fact, in the case when X = M and cr(g/) is the Borel a- algebra, Exer¬ 
cise 1.19 shows that o{sf) has the cardinality of continuum, while has 
the cardinality of <^(R), since it contains all subsets of Cantor’s middle 
third set (see Exercise 1.8). An example of a non-additive set will be built 
in Remark 1.23, so that also the second inclusion in (1.11) is strict. 
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1.6. The Lebesgue measure in R. 

In this section we build the Lebesgue measure on the real line M. To this 
aim, we consider first the set JA of all bounded right open intervals of M 

JA := {(a, b ] : a, b e M, a < b } 


and the collection sA containing 0 and the finite unions of elements of 
<0. Our choice of half-open intervals ensures that sA is a ring, because 
■A is stable under intersection and relative complement (the families of 
open and closed intervals, instead, do not have this property). 

We define 

length ((a, b ]) b — a. 

More generally, any non-empty A e sA can be written, possibly in many 
ways, as a disjoint finite union of intervals /, , i — 1, ..., IV; we define 

N 

HA) :=£]length(/,). (1.12) 

i=l 

Setting A(0) = 0, it is not hard to show by elementary methods that A 
is well defined (i.e. A (A) does not depend on the chosen decomposition) 
and additive on sA. 

In the next definition we introduce the notion of characteristic function, 
which can be used to turn set-theoretic operations into algebraic ones: 
for instance the intersection corresponds to the product, when seen at the 
level of characteristic functions (see also Exercise 1.1). 

Definition 1.19 (Characteristic function of a set). Let A c X. The 

characteristic function fl A : AT —^ {0, 1} is defined by 


1a CO 


1 ifx e A; 

0 if x e X \ A. 


The reader already acquainted with Riemann’s theory of integration can 
also notice that A (A) is the Riemann integral of the characteristic function 
11 A of A, and deduce the additivity property of A directly by the additivity 
properties of the Riemann integral. In the next theorem we shall rigor¬ 
ously prove these facts, and more. We first state an auxiliary lemma, a 
simple consequence of the Bolzano-Weierstrass compactness theorem on 
the real line. 


Lemma 1.20. Any bounded and closed interval J contained in the union 
of a sequence {A„}„ € n of open sets is contained in the union of finitely 
many of them. 
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Proof. Assume with no loss of generality that I — N and A n C A n+ \, 
and assume by contradiction that there exist x n e J \ A n for all n ; by the 
Bolzano-Weierstrass theorem there exists a subsequence (x n ^)) conver¬ 
ging to some x e J. If h is such that x e A k , for k large enough x n(k ) 
belongs to A„, because A,-, is open. But this is not possible, as soon as 
n(k) > h, because x n(k) (/ A n(k) and A n(k) D A„. □ 

Theorem 1.21. The set function X defined in (1.12) is a -additive on ,<//. 

Proof. (A is well defined) Given disjoint partitions I \and J\,... 

..., J m of A e .(/, we say that J\ . J m is finer than if any 

interval 7, is the disjoint union of some of the intervals Jj. Obviously, 
given any two partitions, there exists a third partition finer than both: it 
suffices to take all intersections of elements of the first partition with ele¬ 
ments of the second partition, neglecting the empty intersections. Given 
these remarks, to show that A is well defined, it suffices to show that 
J2i k(/,) = J2j k(Jj) if J\, ..., J m is finer than I\,, /„. This state¬ 
ment reduces to the fact that A(7) = A( F k ) if I e J? is the disjoint 
union of some elements F k e . A; this last statement can be easily proved, 
starting from the identity ( a , h ] = {a, c] U (c, b], by induction on the 
number of the intervals F k . 

(A is additive) If F, G e and F GG = 0, any disjoint decompositions 

of F in intervals 7)./„ e / and any disjoint decomposition of G in 

intervals J\, ..., J m e provide a decomposition 
of F U G in intervals belonging to J?. Using this decomposition to com¬ 
pute A (F U G) the additivity easily follows. 

(A is cr-additive) Let ( F „) c -s/ be a sequence of disjoints sets in s4 and 
assume that 


OO 



(1.13) 


also belongs to srf. 

We prove the additivity property first in the case when F = (x, y] e 
JT. It is also not restrictive to assume that the series A(F n ) is con¬ 

vergent. As any F„ is a finite union of intervals, say N n , we can find, 
given any e > 0, a finite union F' n D F„ of intervals in such that 
A(F ( ') < A( F n ) + s/2" and the internal part of F' n contains F n (just shift 
the endpoints of each interval in F n by a small amount, to obtain a lar¬ 
ger interval in increasing the length at most by e/(N n 2")). Let also 
F'/ be the internal part of Ffi that still includes F n , and let x e (x, y]. 
Then, since [x', y | c F", Lemma 1.20 provides an integer k such 
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that [x', y\ c [Jo F' n . Hence, the additivity of X in . 0 / gives 

\n=0 / n =0 

k £ °° 

< X(F„) + — < 2e + X(F n ). 

n=0 n =0 

By letting first s | 0 and then letting x' j, x we obtain that X(F) < 
JJJ X(F,i). The opposite inequality simply follows by the monotonicity 
and the additivity of X, because the finite unions of the sets F„ are con¬ 
tained in F. 

In the general case, let 

r = LK 

i=l 

where I\, ..., /* are disjoint sets in . Then, since for any i e {1, ..., k] 
we have that /, is the disjoint union of /,- fl F n , we know by the previous 
step that 

OO 

X(I i ) = Y J Wi FF n ). 

n =0 

Adding these identities for i = 1 ,,k, commuting the sums on the 
right hand side and eventually using the additivity of X on stf we obtain 

k 00 k 00 

X(F) = E x(ii n F) = EE x(ii n F n ) = □ 

i =1 n =0 i=l n =0 

We say that a measure /i in (M, Sfi (1R)) is translation invariant if /z(A + 
h) = 11 (A) for all A e fkS (M) and h e 1R (notice that, by Exercise 1.2, 
the class of Borel sets is translation invariant as well). We say also that /z 
is locally finite if /z(7) <00 for all bounded intervals / cM. 

Theorem 1.22 (Lebesgue measure in M). There exists a unique, up 
to multiplication with constants, translation invariant and locally fi¬ 
nite measure X in (K, -AS (M)). The unique such measure X satisfying 
/,([(), 1]) = 1 is called Lebesgue measure. 

Proof. (Existence) Let stf be the class of finite unions of intervals and 
let A. : .</X —> [0, + 00 ) be the a -additive set function defined in (1.12). 
According to Theorem 1.21 A admits a unique extension, that we still 
denote by X, to a (&/) = £$ (R). Clearly X is locally finite, and we can use 
the uniqueness of the extension to prove translation invariance: indeed, 
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for any h e R also the ct- additive measure A i—► X(A + h) is an extension 
of X\^. As a consequence k(A) = X(A + h ) for all h e R. 

(Uniqueness) Let v be a translation invariant and locally finite measure 
in (R, SS (R)) and set c := v([0. 1]). Notice first that the set of atoms of 
v is at most countable (Exercise 1.5), and since R is uncountable there 
exists at least one x such that v({x}) = 0. By translation invariance this 
holds for all x, i.e., v has no atom. 

Excluding the trivial case c = 0 (that gives v = 0 by translation in¬ 
variance and CT-additivity), we are going to show that v = cX on the 
class of finite unions of intervals; by the uniqueness of the extension 
in Caratheodory theorem this would imply that v = cX on (R). 

By finite additivity and translation invariance it suffices to show that 
v([0, t)) = ct for any t > 0 (by the absence of atoms the same holds for 
the intervals (0, t), (0, t\, [0, 1 1). Notice first that, for any integer q > 1, 
[0, i) is the union of q disjoint intervals all congruent to [0, 1 /q)\ as a 
consequence, additivity and translation invariance give 


v([0, 1 /q)) 


v([0, D) 
q 


c 

q 


Similarly, for any integer p > 1 the interval [0, p/q) is the union of p 
disjoint intervals all congruent to [0, 1 /q)\ again additivity and transla¬ 
tion invariance give 


v([0, -)) = pv([0, -)) = c-. 

q q q 

By approximation we eventually obtain that v([0, /)) = ct for all 
t > 0. □ 

The completion of the Borel o-algebra with respect to X is the so- 
called a -algebra of Lebesgue measurable sets. It coincides with the 
class ^ of additive sets with respect to X* considered in the proof of 
Caratheodory theorem (see Exercise 1.12). 

Remark 1.23 (Outer Lebesgue measure and non-measurable sets). 

The measure X* used in the proof of Caratheodory’s theorem is also called 
outer Lebesgue measure, and it is defined on all parts of M. The termin¬ 
ology is slightly misleading here, since X *, though ct- subadditive, fails 
to be cr-additive. In particular, there exist subsets of R that are not Le¬ 
besgue measurable. To see this, let us consider the equivalence relation 
in R defined byx~yifx — j e Q and let us pick a single element 
x e [0, 1] in any equivalence class induced by this relation, thus forming 
a set A c [0,1]. Were this set Lebesgue measurable, all the sets A + h 
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would still be measurable, by translation invariance, and the family of 
sets {A + h}i, e q would be a countable and measurable partition of R, 
with A* (A + h) — c independent of h e Q. Now, if c — 0 we reach a 
contradiction with the fact that A*(R) = oo, while if c > 0 we consider 
all sets A + h with h e Q n [0, 1] to obtain 

2 = A*([0, 2]) > A*(A + h) = oo, 

/;eQn[0,i] 


reaching again a contradiction. 

Notice that this example is not constructive and strongly requires the 
axiom of choice (also the arguments based on cardinality, see Exercise 
1.19 and Exercise 1.20, have this limitation). On the other hand, one 
can give constructive examples of Lebesgue measurable sets that are not 
Borel (see for instance 2.2.11 in [3]). 

The construction done in the previous remark rules out the existence of 
locally finite and translation invariant ct- additive measures defined on all 
parts of R. In R", with n > 3, the famous Banach-Tarski paradox (see 
for instance [6]) shows that it is also impossible to have a locally finite, 
invariant under rigid motions and finitely additive measure defined on all 
parts of R' 1 . 

1.7. Inner and outer regularity of measures on metric spaces 

Let ( E , d ) be a metric space and let /z be a finite measure on ( E , S3 ( E )). 
We shall prove a regularity property of /z. 

Proposition 1.24. For any B e S3 {E) we have 

p{B) = sup{/z(C) : C C B, closed} = inf{/z(A) : A D S, open}. 

(1.14) 


Proof. Let us set 


l-{fief(£): (1.14) holds}. 

It is enough to show that JF is a a-algebra of parts of E including the 
open sets of E. Obviously ^contains E and 0. Moreover, if B e S3S 
then its complement B c belongs to Let us prove now that ( B „) c JE 
implies [J n B n e JE. Lix e > 0. We are going to show that there exist a 
closed set C and an open set A such that 

OO 

C c\jB n c A, 

n=0 


/z(A\C) < 2s. 


(1.15) 
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Let n e N. Since B n e there exist an open set A„ and a closed set C„ 
such that C n C B n C A„ and 


M-(A„ \ C„) < 2«+i ■ 

Setting A := A„, S := (J n C„ we have S C U„ C A and /x(A \ 

S) < s. However, A is open but S is not necessarily closed. So, we 
approximate S by setting S„ := (Jo C*. The set 5„ is obviously closed, 
S„ f 5 1 and consequently /i(.S’„) t M(^)- Therefore there exists n e e N 
such that ii(S\S ne ) < e. Now, setting C = S n , we have C C U„ B n C A 
and /z(A \ C) < /x(A \ 5) + /x(5 \ C) < 2s. Therefore 1J„ B n e We 
have proved that is a cr-algebra. It remains to show that -'Xf contains 
the open subsets of E. In fact, let A be open and set 


where d(x, A c ) := inf veA ^ cl(x, y ) is the distance function from A 1 . Then 
C„ are closed subsets of A, and moreover C n f A, which implies /x(A \ 
C„) | 0. Thus the conclusion follows. □ 

Notice that inner and outer approximation hold for /x-measurable sets 
B as well: one has just to notice that there exist Borel sets B \, B 2 such 
that fij c B c B 2 with n (B 2 \ B t ) = 0. and apply inner approximation 
to B i and outer approximation to B 2 . 

Remark 1.25 (Inner and outer approximation for cr-finite measures). 

It is possible to extend the inner approximation property to cr-finite meas¬ 
ures: suffices to assume the existence of a sequence of closed sets C„ with 
finite measure such that fi(X\\J„C„) = 0. Indeed, assuming with no loss 
of generality that C„ C C n+ \, we know that for any Borel set B and any 
n e N it holds 

fi(B fl C n ) = sup{/x(C) : C closed, C C B D C n }, 


C n = \ x e E : d(x, A c ) 


so that 

/x(B n C n ) < sup {/x(C) : C closed, Cc8|. 

Letting nf oowe recover the inner approximation property. 
Analogously, if we assume the existence of a sequence of open sets A„ 
with finite measure satisfying X = U„ A„, we have the outer approxim¬ 
ation property: indeed, for any n and any c > Owe can find (assuming 
with no loss of generality /x(B) < +oo) open sets B n C A„ containing 
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B D A n and such that ii(B n \ (B fl A,,)) < e2 It follows that U n B n 
contains B and 



Since B n are also open in X, the set U„ B„ is open and since e is arbitrary 
we get the outer approximation property. 

We conclude this chapter with the following result, whose proof is a 
straightforward consequence of Proposition 1.24 (alternatively, one can 
use Dynkin’s argument, since the class of closed sets is a jt -system and 
generates the Borel a- algebra). 

Corollary 1,26. Let /x, v be finite measures in (E,8$ (E)), such that 
/x(C) = v{C) for any closed subset C of E. Then ji = v. 


Exercises 

1.1 Given A C Z, denote by f A : X —> {(), I j its characteristic function, equal 
to 1 on A and equal to 0 on A c . Show that 

1/tuB = max{l A , i B }, t Ar>B = m in{l A , 1 B }, 1 A c = l x —l A 


and that 


lim sup A„ = A 

n—>oo 


lim sup t A =t A , 

n—>oo 


lim inf A„ = A 

n->oo 


1.2 Let A C R" be a Borel set. Show that 


lim inf Ia =t A . 

n—> oo n 

for h e R" and t e R the sets 


A + h := {a + h : a e A}, t A {ta : a e A} 

are Borel as well. 

1.3 Find an example of a <7-additive measure /x on a ex-algebra sT such that 
there exist A„ e sT with A n \ A and inf,, /x(A„) > /x(A). 

1.4 Let /x be additive and finite, on an algebra sT. Show that /x is er-additive if 
and only if it is continuous along nonincreasing sequences. 

1.5 Let /x be a finite measure on (X, $). Show that the set of atoms of /x, defined 
by 

A m := {x e X : {.r} e £ and /x({x}) > 0} 

is at most countable. Show that the same is true for er-finite measures, and 
provide an example of a measure space for which this property fails. 

1.6 Let (Z, £, /x) be a measure space, with /x finite. We say that /x is diffuse if 
for all A e £ with /x(A) > 0 there exists B C A with 0 < /x(B) < /x(A). Show 
that, if /x is diffuse, then /x(£) = [0, /x(Z)]. 

1.7 Show that if X is a separable metric space and £ is the Borel cr-algebra, 
then a cr-additive measure /x : £ —*■ [0, +oo) is diffuse if and only if /x has no 
atom. 
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1.8 Let X be the Lebesgue measure in [0, 1], Show the existence of a /--negli¬ 
gible set having the cardinality of the continuum. Hint: consider the classical 
Cantor’s middle third set, obtained by removing the interval (1/3, 2/3) from 
[0, 1], then by removing the intervals (1/9, 2/9) and (7/9, 8/9), and so on. 

1.9 Let X be the Lebesgue measure in [0, 1]. Show the existence, for any e > 0, 
of a closed set C C [0, 1] containing no interval and such that X(C) > 1 — e. 
Hint: remove from [0, 1] a sequence of open intervals, centered on the rational 
points of [0, 1], 

1.10 Using the previous exercise, write [0, 1] = A U B where A is negligible 
in the measure-theoretic sense (i.e. X(A) = 0) and B is negligible in the Baire 
category sense (i.e. it is the union of countably many closed sets with empty 
interior). So, the two concepts of negligible should be never used at the same 
time. 

1.11 ★ Let X be the Lebesgue measure in [0, 1]. Construct a Borel set E C (0, 1) 
such that 

0 < X(EC\ I) < X(I) 
for any open interval I C (0. 1). 

1.12 Let (X, S, ii) be a measure space and let //* : &(X) -* [0, +oo] be 
the outer measure induced by //. Show that the completed a -algebra S jL is 
contained in the class 'tf of additive sets with respect to //*. 

1.13 Let (X, S, fj.) be a measure space and let //* : (X) —> [0, +oo] be the 

outer measure induced by //. Show that for all A C X there exists B e S 
containing A with fi(B) — ix*(A). 

1.14 Let (X, S, //) be a measure space. Check the following statements, made 
in Definition 1.12: 

(i) Sp is a a -algebra; 

(ii) the extension //(A) := /i(B), where B e S is any set such that A AB is 
contained in a //-negligible set of S, is well defined and cr-additive on 

(iii) //-negligible sets of S jL are characterized by the property of being coin- 
tained in a //-negligible set of S. 

1.15 ★ Let (X, S, //) be a measure space and let //* : A? (X) —> [0, +oo] be 
the outer measure induced by //. Show that if //(X) is finite, the class of 
additive sets with respect to //* coincides with the class of (^-measurable sets. 
Hint: one inclusion is provided by Exercise 1.12. For the other one, given an 
additive set A, by applying Exercise 1.13 twice, find first a set B e S with 
/jL*(B \ A) — 0, and then a set C e S with //(C) = 0 and B \ A C C. 

1.16 Find a a -algebra S C £? (N) containing infinitely many sets and such that 
any B e S different from 0 has an infinite cardinality. 

1.17 Find // : ^(N) —»• {0, +oo} that is additive, but not er-additive. 

1.18 ★ Let co be the first uncountable ordinal and, for .'Xf C fX (X), define by 

transfinite induction a family i e co, as follows: := JCU (0), 

OO 

(J A k , B c : (A k ) c B e ,? U) 

k=0 


J?(i) ;= 
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if i is the successor of j, and := U/s; ^ ^ otherwise. 

Show that U ;e(U = o(/XS). 

1.19 ★ Show that 58 (R) has the cardinality of the continuum. Hint: use the con¬ 
struction of the previous exercise, and the fact that co has at most the cardinality 
of continuum. 

1.20 ★ Show that the er-algebra S£ of Lebesgue measurable sets has the same 
cardinality of /S’OR), thus strictly greater than the continuum. Hint: consider all 
subsets of Cantor’s middle third set. 

1.21 ★ ★ Show that the cardinality of any a -algebra is either finite or uncount¬ 
able. 

1.22 Let X be a set and let sS C S3 (X) be an algebra with finite cardinality. 
Show that its cardinality is equal to 2 n for some integer n > 1 . 

1.23 ★ Let ( X , S’, n ) he a a measure space and suppose that X is finite or count¬ 
able. Show the existence of a measure jl on 'S’ (X) that extends /x, that is, 
/x(A) = tL(A) for all A e S. 

1.24 ★ ★ Find an example of an additive set function /x : /S’ (N) —> {0, 1}, with 
/x(N) = 1 and /x({n}) = 0 for all n e N (in particular /x is not a -additive, the 
construction of this example requires Zorn’s lemma). 

1.25 ★ Let C e SS ([(), 1]) with /AC) > 0. Without using the continuum hypo¬ 
thesis, show that C has the cardinality of continuum. 

1.26 ★ Let ( K , cl) be a compact metric space and let /x be as in Exercise 1.24. 
Let’s say that a sequence (x n ) C K /x-converges to x e K if 

ti{[n e N : d(x n ,x) > e}) = 0 Ve > 0. 

Show that any sequence (x „) C K is /x-convergent and that the /x.-limit is 
unique. 


Chapter 2 
Integration 


This chapter is devoted to the construction of the integral of &■ -measur¬ 
able functions in general measure spaces (Q, S, /i), and its main con¬ 
tinuity and lower semicontinuity properties. Having built in the previous 
chapter the Lebesgue measure in the real line M, we obtain as a byproduct 
the Lebesgue integral on M; in the last section we compare Lebesgue and 
Riemann integral. 

In the construction of the integral we prefer to empahsize two view¬ 
points: the first, more traditional one 

f fdpi = ^2 = z}) 

Jx zelm(/) 

is appropriate to deal with simple functions (i.e. functions whose range is 
finite) and useful to show the additivity of the integral with respect to /. 
The second one, for nonnegative functions is summarized by the formula 

[ fdfi = [ /z({/ > t})dt. 

Jx Jo 

This second viewpoint is more appropriate to show the continuity prop¬ 
erties of the integral with respect to / (the integral on the right side can 
be elementarily defined, since t \-* /z({/ > t }) is nonincreasing, see 
Section 2.4.3). Of course we show that the two viewpoints are consistent 
if we restrict ourselves to the class of simple functions. 

2.1. Inverse image of a function 

Let X be a non empty set. For any function <p: X —»■ Y and any I e 
8P (Y) we set 

c p~ x {l ) := {x e X : cp(x) e 1} = {(p e /}. 

The set (p~ l (I) is called the inverse image of I. 
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Let us recall some elementary properties of (p 1 (the easy proofs are left 
to the reader as an exercise): 

(i) (p~\l c ) = (<p _1 (/)) c for all I e &(Yy, 

(ii) if {Ji}iei C 3^(Y) we have 

u=^“‘(U *)’ rv iw) = 

iel iel iel iel 

In particular, if I n / = 0 we have cp~ l (I) fl ip~ l (J ) = 0. Also, if 
S C fS (Y) and we consider the family <p~ l (S) of subset of X defined 
by 

<p-\g):={<p-\l): leg), (2.1) 

we have that <p~ l (S) is a er-algebra whenever A is a a-algebra. 

2.2. Measurable and Borel functions 

We are given measurable spaces (X, S) and (Y, SP). We say that a func¬ 
tion <p\ X — > Y is (S, &(-measurable if <p~ { (TT) C S. If (Y, J£") = 
(K, 38 (M)), we say that ip is a real valued S-measurable function, and if 
(A, d) is a metric space and S is the Borel a -algebra, we say that (p is a 
real valued Borel function. 

The following simple but useful proposition shows that the measurab¬ 
ility condition needs to be checked only on a class of generators. 

Proposition 2.1 (Measurability criterion). Let <£ c & be such that 
cs (S) — SF. Then <p: X —> Y is (S , &)—.measurable if and only if 
(p~ l (I) e Sfor all 1 e (equivalently, iffip~ l ((S) C S). 

Proof. Consider the family := {/ e & : (/) e Sj. By the 

above-mentioned properties of <p~ l as an operator between fS ( Y) and 
TP(X), it follows that is a a-algebra including S. So, it coincides 
with a(fS) = □ 

A simple consequence of the previous proposition is the fact that any 
continuous function is a Borel function: more precisely, assume that ip : 
X —»■ Y is continuous and that S — 38 (X) and ST = 38 (Y). Then, the 
a -algebra 

{AcT: <p~ l (A) e 38(X)) 

contains the open subsets of T (as, by the continuity of <p, <p _ 1 (A) is 
open in X , and in particular Borel, whenever A is open in T), and then it 
contains the generated o -algebra, i.e. 38 (T). 

The following proposition, whose proof is straightforward, shows that 
the class of measurable functions is stable under composition. 
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Proposition 2.2. Let (X, S), (Y, XP), (Z, XI) be measurable spaces and 
let (p : X Y and f : Y Z be respectively ( S , XX)-measurable and 
(■XX, X?)-measurable. Then \jr o <p is (S, X?)—measurable. 

It is often convenient to consider functions with values in the extended 
space R := R U {+oo, —oo}, the so-called extended functions. We say 
that a mapping ip : X —> R is ^-measurable if 

oo}), ^ -1 ({+oo}) e S and ip~ l (I)eS, V7e^(M). 

( 2 . 2 ) 

This condition can also be interpreted in terms of measurability between 
S and a suitable Borel a-algebra in R, see Exercise 2.3. Analogously, 
when (A, d) is a metric space and S is the Borel a -algebra, we say that 
(p: X —► R is Borel whenever the conditions above hold. 

The following proposition shows that extended ^-measurable func¬ 
tions are stable under pointwise limits and countable supremum and in- 
fimum. 

Proposition 2.3. Let ( cp n ) be a sequence of extended S-measurable func¬ 
tions. Then the following functions are S-measurable: 

sup (p„(x), inf <pi,(x), lim sup ip n (x), liminf^„(x). 

nsN neN n —»oo n—>oo 

Proof. Let us prove that <p(x) := sup,, (p„(x) is (^-measurable (all other 
cases can be deduced from this one, or directly proved by similar argu¬ 
ments). For any a £ R we have 

OO 

<p _1 ([—oo,a]) = Pl^O-oc^a]) £ S. 

n =0 

In particular {ip = —oo} £ S, so that ip~ [ ((— oo, a]) £ S for all a £ R; 
by letting a f oo we get <p -1 (R) e S. Asa consequence, the class 

{/ £ ^(R) : (p~\l) £ S) 

is a cr-algebra containing the intervals of the form (—oo, a] with a £ R, 
and therefore coincides with XS (R) . Eventually, {ip = +oo} = A \ 
[<p _1 (R) U {ip = —oo}] belongs to S as well. □ 

2.3. Partitions and simple functions 

Let (A, S) be a measurable space. A function ip : A —»■ R is said to be 
simple if its range ip( A) is a finite set. The class of simple functions is 
obviously a real vector space, as the range of ip + f is contained in 


{a + b : a £ range(^), b £ ranged)} • 
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If ( p(X ) = a n }, with at cij if i / j, setting A,- = q> 1 ({«/}), 

i = I. n we can canonically represent <p as 

n 

<p{x) = re! (2.3) 

jt=i 

Moreover, Ai,, A n is & finite partition of X (i.e. A ,• are mutually 
disjoint and their union is equal to X). However, a simple function p has 
many representations of the form 

N 

<P(x) = xeX, 

k=\ 

where A\,..., A' N need not be mutually disjoint and a' k need not be in 
the range of <p. For instance 


%U) + %1,2] — %),2] + 21[1,2]- 


It is easy to check that a simple function is (^-measurable if, and only if, 
all level sets A k in (2.3) are ^-measurable; in this case we shall also say 
that {Ayt} is a fin ite S’—measurable partition of X. 

Now we show that any nonnegative (^-measurable function can be ap¬ 
proximated by simple functions; a variant of this result, with a different 
construction, is proposed in Exercise 2.8. 

Proposition 2.4. Let (p be a nonnegative extended $—measurable func¬ 
tion. For any n e N*, define 


< Pnix ) = 


i — 1 i — 1 i 

- if - < (p{x) < — , i = 1, 2, ..., n2 ; 

2 " 2 " — 2 " 


if i p(x) > n. 


(2.4) 


Then (p„ are simple and S’—measurable, {(p n ) is nondecreasing and con¬ 
vergent to <p. If in addition tp is bounded the convergence is uniform. 


Proof. It is not difficult to check that (<p„) is nondecreasing. Moreover, 
we have 


1 

0 < <p(x) — (p n (,x) < — if <p(x) < n, x e X, 
and 

0 < (p(x) — (p n (x) = cp(x) — n if cp(x ) > n, x G X. 
So, the conclusion easily follows. 


□ 
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2.4. Integral of a nonnegative ^-measurable function 

We are given a measure space (X, S, /i). We start to define the integral 
for simple nonnegative functions. 

2.4.1. Integral of simple functions 

Let ip be a nonnegative simple (^-measurable function, and let us repres¬ 
ent it as 

N 

<p(x) = y^a k t Alc , xeX, 

k=l 

with N € N, a\, ..., a N > 0 and A\, ..., A N in S. Then we define 
(using the standard convention in the theory of integration that 0-oo = 0), 

f N 

/ ip d/i := y ^a k ii(A k ). 

k= 1 

It is easy to see that the definition does not depend on the choice of the 
representation formula for ip. Indeed, let [h|..... b M ] be the range of ip 
and let ip = Y^ with Bj := (p~ l (bj), be the canonical representa¬ 
tion of ip. We have to prove that 

N M 

^2 a kH(A k ) = ^bjn(Bj). (2.5) 

k= 1 j= 1 

As the B; ’s are pairwise disjoint, (2.5) follows by adding the M identities 

N 

Y2a k /i(A k fi Bj) = bjii(Bj) j = 1,..., M. (2.6) 

*=] 

In order to show (2.6) we fix j and consider, for / C {1,..., IV}, the sets 

A/ := [x e Bj : x e A, iff i e /}, 

so that {A/} are a (^-measurable partition of Bj and x € A/ iff the set 
of i ’s for which x € A, coincides with I. Then, using first the fact that 
A/ c A, if / e /, and A, fl A/ = 0 otherwise, and then the fact that 
Y a k — bj whenever A/ ^ 0 (because Y\ a ^A t coincides with bj, the 

kel 

constant value of ip on Bj), we have 

N N N 

J2 a kiMA k nBj) = a k ii(A k n A/) = EE a k /i(A k n A/) 

k= 1 k= 1 I 1 k= 1 

= EE flA-/x(A/) = y^bjuiA!) = bjii(Bj). 

I kel I 


28 Luigi Ambrosio, Giuseppe Da Prato and Andrea Mennucci 


Proposition 2.5. Let ip, \[r be simple nonnegative S-measurable func¬ 
tions on X and let a, fi > 0. Then oup + fhj/ is simple, S-measurable 
and we have 


/ (cup + ff) dp. = a / (pd/i + P / xj/dp, 

Jx Jx Jx 


Proof. Let 

n m 

(p = ^2 a k l Ak , f = b h l B h 
k =1 h= 1 

with {A k },{Bh} finite -measurable partitions of X. Then { A k n B h } is a 
finite ^-measurable partition of X and a(p + ( J nj/ is constant (and equal 
to aa k + fib/,) on any element A k fl B h of the partition. Therefore the 
level sets of oup + ft f are finite unions of elements of this partition and 
the (^-measurability of cup + fh/r follows (see also Exercise 2.2). Then, 
writing 


tp(x) = EE a ^ At nB h (*)> f 0 ) = EE b h^A k nB h (x), xeX, 

k= 1 h =1 ifc=l h= 1 

we arrive at the conclusion. □ 

2.4.2. The repartition function 

Let ip : X —> M be S --measurable . The repartition function F of <p, relat¬ 
ive to /x, is defined by 

Fit) := pt({tp > f}), t e M. 

The function F is nonincreasing and satishes 

lim F(t) = lim F(n) = lim pt({ip > — n}) = /z({<p > — oo}), 

t —>■—oo n—>—o o n—>oo 

and, if // is hnite, 

lim F(t) = lim F(n) = lim pi({ip > n}) = pi({ip = +oo}), 

f—>+oo n—> oo n—>oo 

since 


oo oo 

{(P > -oo} = |J{<p > —ri), {(p = Too} = P){^ > n). 

n =1 n =1 


Other important properties of F are provided by the following result. 
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Proposition 2.6. Let cp: X —> M be S’-measurable and let F be its re¬ 
partition function. 

(i) For any to £ M we have lim F(t) = F(to), that is, F is right con- 

i^t+ 

tinuous. 

(ii) If p, is finite, for any to e M we have lim F(t) = /x({<p > to)}, that 

r->r 0 

is, F has left limits (1 
Proof. Let us prove (i). We have 

lim F(t) = lim F (to H— ) — lim /x ( I (p > to + - 1 ) 

t-*tf n^+oo \ n) n^+oo y [ H J / 

= > to}) = F(to), 


since 


W > t 0 } = I J I <p > to + - 1 = lim 
, \ n \ n^roc 

n= 1 1 ’ 

So, (i) follows. We prove now (ii). We have 


lim F(t) = lim F I f 0 - 

t-HZ n^+oo V n 


<P > to + - } ■ 

oo i n 


since 


= lim /x {<p > to - - = p.{{<p > t 0 }), 

n—t+oo \ | n 


{<P > to} = P) I (p > to - - } = lim I cp > to - - j 

I n J n^o o l n J 


and (ii) follows. 


□ 


From Proposition 2.6 it follows that, in the case when pc is finite, F is 
continuous at t 0 iff pt({ip = to}) = 0. 

Now we want to extend the integral operator to nonnegative (^-mea¬ 
surable functions. Let cp be a nonnegative, simple and (^-measurable 
function and let 

n 

<p(x) = ^a k l Ak , X e X, 

k=0 


^ In the literature F is called a cadlag function. 
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with n e N*, 0 = a 0 < «i < a 2 < ■ ■ ■ < a n < oo. Then the repartition 
function F of (p is given by 


F(t) = 


H(Ax) + fi(A 2 ) H-b /x(A„) = F( 0) if 0 < t < a { 

/u(A 2 ) + /i(A 3 ) H-b /x(A„) = F(a0 if a x < t < a 2 


H(A n ) = F(a n - 1 ) if a n -i < t < a n 

0 = F(a n ) if t >a n . 


Consequently, we can write 


n n n 

/ c p(x)dn(x ) = ^a k iJ,(A k ) = ^a k (F(a k -i) - F(a k )) 

k= 1 *=i 

ft ft 

= y'a k F{a k -x) - y" y a k F(a k ) 

k= 1 A:=l 

ft — 1 ft— 1 

= F(a k ) - y^a k F(a k ) 

k =0 £=0 

ft— 1 ,oo 

= VVar+i - a k )F(a k ) = / F(t)dt. 

k =o 7o 


(2.7) 


Example 2.7. We set X = M, /z = A, 

A, = [1,2] U [10, 11], A 2 = [2, 3], A 3 = [3,4], 

A 4 = [4, 6], A 5 = [7, 10], 

= 5, a 2 =, a-i = 10, a 4 = 7, as = 2 

and (p := Yll=\ a k^A k t0 be the simple function shown in Figure 2.1. It is 
easy to verify that F has the graph shown in the right picture in Figure 2. 1 . 
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Figure 2.1. a simple function (p , and its repartition F 
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The color scheme used for the areas below the two graphs in 2.1 proves 
graphically that the areas are identical. 

Now, we want to define the integral of any nonnegative extended §- 
measurable function by generalizing formula (2.7). For this, we need 
first to define the integral of any nonnegative nonincreasing function in 
(0, Too). 

2.4.3. The archimedean integral 

We generalize here the (inner) Riemann integral to any nonincreasing 
function / : [0, Too) —> [0, Too], The strategy is to consider the su- 
premum of the areas of piecewise constant minorants of /. 

Let E be the set of all finite decompositions a = {/],..., ty} of 
[0, Too], where N eW and 0 = to < t\ < ■ ■ ■ < W < Too. 

Let now /: [0, Too) —> [0, Too] be a nonincreasing function. For 
any a = {to, t\, ... , ty} e E we consider the partial sum 


N -1 


If(°) '■= ^ /(4+i)(4+i — tk). 

k=0 


( 2 . 8 ) 


We define 

f(t)dt := sup{/f(er) : a £ E}. (2.9) 

The integral / 0 °° fit) dt is called the archimedean integral of /. It enjoys 
the usual properties of the Riemann integral (see Exercise 2.5) but, among 
these, we will need only the monotonicity with respect to / in the sequel. 
For our purposes the most relevant property of the Archimedean integral 
is instead the continuity under monotonically nondecreasing sequences. 


Proposition 2.8. Let f n f f, with f„ : [0, Too) 
creasing. Then 



( t ) dt \ 



[0, Too] nonin- 


Proof. It is obvious that 


/»oo n OO 

/ fn(t)dt < / f{t)dt. 

Jo Jo 


To prove the converse inequality, fix L < / 0 °° f(t)dt. Then there exists 
o = [t\, ..., tff} £ E such that 


N -1 

/(4) (4+i -4) > L. 

k=0 
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Since for n large enough 


N-\ 


0 k=0 


fn(t)dt > E fi(tk+ i)(4+i — 4) > re¬ 


letting n ^ co we find that 


CO 


sup fn (t ) dt > L. 


;ieN JO 


This implies 


sup / f n (t)dr > I f (t) dt 



and the conclusion follows. 


□ 


2.4.4. Integral of a nonnegative measurable function 

We are given a measure space ( X , S’, /z) and an extended nonnegative 
(^-measurable function cp. Having the identity (2.7) in mind, we define 



( 2 . 10 ) 


Notice that the function t fi({(p > t\) e [0, +oo] is nonnegative and 
nonincreasing in [0, +oo), so that its archimedean integral is well defined 
and (2.10) extends, by the remarks made at the end of Section 2.4.2, the 
integral elementarily defined on simple functions. If the integral is finite 
we say that <p is /i-inteprable. 

It follows directly from the analogous properties of the archimedean 
integral that the integral so defined is monotone, i.e. 



Indeed, (p > x// implies {cp > t} D {i/' - > t} and pi{{(p > t}) > /x({> 
t\) for all t > 0. Furthermore, the integral is invariant under modifica¬ 
tions of ip in /r-negligible sets, that is 



<p = /z-a.e. in X 


To show this fact it suffices to notice that cp = x// /x-a.e. in X implies that 
the sets {tp > t} and {i Jr > t] differ in a /x—negligible set for all I > 0, 
therefore pt({(p > ?}) = /x({r/r > t}) for all t > 0. 

Let us prove the following basic Markov inequality. 
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Proposition 2.9. For any a e (0, +oo) we have 

pi({ip > a}) < - [ (p(x)dfi(x). (2-11) 

a Jx 

Proof. For any a e (0, +oo) we have, recalling the inclusion {ip > a} C 
{ip > t) for any t e (0, a), that pi({ip > f}) > fi({<p > a }) for all 
t e (0, a). The monotonicity of the archimedean integral gives 

/» /»OO /»oo 

/ <p(x)dii{x)= / /r({^ > > / l (0a) (i)/x({(p > t})dr 

Jx Jo Jo 

> a/x({(p > a}). □ 

The Markov inequality has some important consequences. 

Proposition 2.10. Let ip : X [0, +oo] be an extended $-measurable 
function. 

(i) If cp is ii-integrable then the set {(p = +oo} has gL—measure 0, that 
is, (p is finite gi—a.e. in X. 

(ii) The integral of ip vanishes iff ip is equal to 0 fi-a.e. in X. 

Proof, (i) Since f x ip dpi < o owe deduce from (2.11) that 

lim fi({ip > a}) = 0. 

<2—>+OQ 


Since 

OO 

{(p = oo} = P|{<p > n], 

n= 1 

by applying the continuity along decreasing sequences in the space ({ip > 
1} (with finite n measure) we obtain 

pc({ip = oo}) = lim fi({<p > n}) = 0. 

n—>+oc 

(ii) If f x ip dpi = 0 we deduce from (2.11) that pi({(p > a}) = 0 for all 
a > 0. Since 


pi({ip > 0}) = 


lim pi({ip > -}) = 0, 

n- S-+OQ n 


the conclusion follows. The other implication follows by the invariance 
of the integral. □ 
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Proposition 2.11 (Monotone convergence). Let (<p n ) be a nondecreas¬ 
ing sequence of extended nonnegative S’—measurable functions and set 
(p(x) := lim (p n (x) for any x e X. Then 

n—> oo 


f 

Jo 


(p n (x) dll{x) \ 



(p(x) dp,{x). 


Proof. It suffices to notice that p.{{(p n > t}) t diW > t}) for all t > 0, 
and then to apply Proposition 2.8. □ 


Now, by Proposition 2.4 we obtain the following important approxim¬ 
ation property. 

Proposition 2.12. Let (p : X —»■ [0, +oo] be an extended $-measurable 
function. Then there exist simple S'—measurable functions (p n : X —> 
[0, +oo) such that <p n \ (p, so that 

poo poo 

/ ( p n (x)dp.(x ) f / (p(x) dp(x). 

Jo Jo 

Remark 2.13 (Construction of Lebesgue and Riemann integrals). 

Proposition 2.12 could be used as an alternative, and equivalent, defini¬ 
tion of the Lebesgue integral: we can just define it as the supremum of the 
integral of minorant simple functions. This alternative definition is closer 
to the definitions of Archimedean integrals and of inner Riemann integ¬ 
ral: the only (fundamental) difference is due to the choice of the family of 
“simple” functions. In all cases simple functions take finitely many val¬ 
ues, but within the Lebesgue theory their level sets belong to a or-algebra, 
and so the family of simple function is much richer, in comparison with 
the other theories. 

We can now prove the additivity property of the integral. 


Proposition 2.14. Letcp, Jr : X [0, oo] be S’ -measurable functions. 
Then 

/ {up + Jr) dp. = / <pdp.+ / Jr dp.. 

Jx Jx Jx 

Proof. Let <p n , \jr n be simple functions with <p n f < p and i jr n f Jr. Then, 
the additivity of the integral on simple functions gives 

/ (cp n + Jr n ) d p. = / (p n dp+ / Jr„dp. 

Jx Jx Jx 

We conclude passing to the limit as n —> oo and using the monotone 
convergence theorem. □ 


The following Fatou’s lemma, providing a semicontinuity property of 
the integral, is of basic importance. 
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Lemma 2.15 (Fatou). Let <p n : X —»■ [0, +oo] be extended ^-measur¬ 
able functions. Then we have 


L 


lim \ r\\'qj n (x) djiix) < liminf 

n—> oo n—> oo 


L 


<p n (x)dn(x). 


( 2 . 12 ) 


Proof. Setting ^>(jc) := lim inf,, <p„(x), and f„(x) = inf,„>„<p m (x), we 
have that ifr n (x) f ^>(x). Consequently, by the monotone convergence 
theorem, 


L 


<p(x)rf/x(x) = lim / \Jr„ (x)rf/x(x). 


L 


On the other hand 


/ d/i(x) < / cp n (x) d/J.(x), 

Jx Jx 


so that 


L 


iminf / 

»^oo J x 


(p(x)dfi(x) < liminf tp n (x)dn(x). 


□ 


In particular, if <p„ are pointwise converging to <p, we have 

/ <p(x) dpt{x) < liminf / <p„(x) dpt(x). 
Jx ll ^°° Jx 


2.5. Integral of functions with a variable sign 

Let <p: X —> M be an extended (^-measurable function. We say that tp 
is /i-integrable if both the positive part (p + (x) := max{^>(x), 0} and the 
negative part cp~(x) := max{— ip{x), 0} of <p are /z-integrable in X. As 
tp = (p + — <p~, in this case it is natural to define 


/ <p(x) d/z(x) := / (p + (x) dpi(x) — / <p (x)dpi(x). 

Jx Jx Jx 

As \(p\ = <p + + (p~, the additivity properties of the integral give that 


ip is /x-integrable if and only if / |<p| d/i < oc. 

Jx 


Let tp : X -*■ M and let A e A be such that t A <p is /x-integrable. We define 
also 


/ ip(x) d/i(x) := / l A (x)^(x) r//r(x). 

Ja Jx 

In the following proposition we summarize the main properties of the 
integral. 
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Proposition 2.16. Let ip, x/s : X —> M be pi—integrable functions. 

(i) For any a, ft e R we have that cap + fix// is pi—integrable and 

/ {dip + fxf) dpi = a / ipdpi + f / i J/dpi. 

Jx Jx Jx 


(ii) If ip < xf in X we have I ip dpi < I xjr dpi. 


(iii) 


/ ip dpi 

Jx 


L 


/ xpd\± < 

Jx Jx 


< / \ip\dfi. 


Proof, (i). Since (— ip) + = ip and {—ip) = ip + , we have f x —ip dpi = 
— f x ip dpi. So, possibly replacing ip by —ip and xf by — f we can assume 
that a > 0 and f > 0. We have 


{cap + fJxf) + + cap + fxp- = {oap + fixf) + cap + + /3xf + , 


so that we can integrate both sides and use the additivity on nonnegative 
functions to obtain 

/ {onp + fix/s) + dpt + a / ip~dpx + f / xf-dpi 

Jx Jx Jx 

= / {dip + fix/s)- dpi + a / ip + dpi + /3 / xf + dpi. 

Jx Jx Jx 


Rearranging terms we obtain (i). 

(ii) . It follows by the monotonicity of the integral on nonnegative func¬ 
tions and from the inequalities ip + < xf + and ip~ > x//~. 

(iii) . Since — \ip\ < ip < \<p\ the conclusion follows from (ii). □ 


Another consequence of the additivity property of the integral is the 
additivity of the real-valued map 

A £ S' i—^ / ip d pi 

J A 

whenever ip is /x-integrable. We will see in the next section that, as a 
consequence of the dominated convergence theorem, this map is even 
a -additive. 


2.6. Convergence of integrals 

In this section we study the problem of commuting limit and integral; 
we have already seen that this can be done in some particular cases, as 
when the functions are nonnegative and monotonically converge to their 
supremum, and now we investigate some more general cases, relevant for 
the applications. 
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Proposition 2.17 (Lebesgue dominated convergence theorem). Let 

(cp,, ) be a sequence of S’-measurable functions pointwise converging to 
i p. Assume that there exists a nonnegative n—integrable function f such 
that 

\<p n (x)| < x//(x) Vx e X, n e N. 

Then the functions (p„ and the function </? are p—integrable and 


lim 

n—> oo 


L 


<Pn dfl = 


L 


tp dp. 


Proof. Passing to the limit as n —> oo we obtain that i p is (^-measurable 
and \(p\ < i// in X. In particular (p is /x-integrable. Since <p + f is 
nonnegative, by the Fatou lemma we have 


/ (cp + VO dp < lim inf / (<p n + f ) dp,. 

Jx n ^°° Jx 


Consequently, 


/ (pdp < liminf / <p n dp. 

Jx Jx 


In a similar way we have 


/ — y?)< liminf / (i jr — (p n )dp. 

Jx n ^°° Jx 


(2.13) 


Consequently, 


/ (p dp > lim sup / cp„dp. 

Jx n —> oo Jx 


Now the conclusion follows by (2.13) and (2.14). 


(2.14) 

□ 


An important consequence of the dominated convergence theorem is 
the absolute continuity property of the integral of /x-integrable func¬ 
tions (p\ 


for any e > 0 there exists S > 0 such that /x(A) < 8 


L 


\(p\ dp < e. 
(2.15) 


The proof of this property is sketched in Exercise 2.9. 
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2.6.1. Uniform integrability and Vitali convergence theorem 

In this subsection we assume for simplicity that the measure \± is finite. 
A family of M-valued /x-integrable functions is said to be /z- 

uniformly integrable if 

lim / \w,(x) \ du(x) = 0, uniformly in / e I. 

h(A)^oJ a 

This means that for any s > 0 there exists 8 > 0 such that 

< 8 => / \(pi{x)\dp(x) < s Vi el. 

J A 

This property obviously extends from single functions to families of func¬ 
tions the absolute continuity property of the integral. 

Notice that any family {^,}, s / dominated by a single /z-integrable 
function <p (i.e. such that |<p,| < \ip\ for any i £ /) is obviously fi- 
uniformly integrable. Taking this remark into account, we are going to 
to prove the following extension of the dominated convergence theorem, 
known as Vitali Theorem. 


Theorem 2.18 (Vitali). Assume that /z is a finite measure and let (<p„) he 
a p-uniformly integrable sequence of functions pointwise converging to 
a real valued function (p. Then cp is p.-integrable and 

lim / <p n dp.= / (pdp. 

n—>oc J x J x 

To prove the Vitali theorem we need the following Egorov Lemma. 

Lemma 2.19 (Egorov). Assume that /x is a finite measure and let (cp,,) 
be a sequence of S’-measurable functions pointwise converging to a real 
valued function ip. Then for any 8 > 0 there exists a set A$ £ S' such that 
fi(Ag) < 8 and (p n (p uniformly in X \ As. 

Proof. For any integer m > 1 we write X as the increasing union of the 
sets B n _ m , where 


B n m •— \x £ X : | cpi(x) — cp(x )| < — V/ > n 

m 

Since /z is hnite there exists n(m) such that n(B n ( mj m ) > pAX) — 2~ m 8. 
We denote by As the union of X \ B n ( m) m , so that 


vA AA 8 

<: / j \ B n ( m ) m ) < y — 8. 


m= 1 


m=l 
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Now, given any e > 0, we can choose m > I /s to obtain that 

1 

I <Pn(x) - <p(x) | < — < e for all x e B n(m)jn , n > n(m). 
m 

As X \ A& c B n ( m y m , this proves the uniform convergence of (p„ to < p on 
X\As. ' □ 


Proof of the Vitali Theorem. Fix e > 0 and find <5 > 0 such that 
f A \(p n | dp < e whenever p(A) < 8. Again, Fatou’s Lemma yields that 
f A \<p\ d/ji < e whenever /x(A) < 8. 

Assume now that A is given by Egorov Lemma, so that cp n —> <p uni¬ 
formly on X \ A. Then, writing 


/ Up ~ (p n )dii = / ( 

Jx Jx\A 


ifp 


<Pn) d/J.+ (<P 

J A 


(Pn)dfl 


and using the fact that lim„ sup \ <p n — ip \ = 0 we obtain 

X\A 



(ip - tp n ) d/i 


< 3 £ 


for n large enough. The statement follows letting e | 0. 


□ 


2.7. A characterization of Riemann integrable functions 

The integrals fj f dk, with J = [a, b\ closed interval of the real line and 
A equal to the Lebesgue measure in M, are traditionally denoted with the 
classical notation j ^ f dx or with f j f dx. This is due to the fact that 
Riemann’s and Lebesgue’s integral coincide on the class of Riemann’s 
integrable functions. 

We denote by /*(/) and /*(/) the upper and lower Riemann integral 
of / respectively, the former defined by taking the supremum of the sums 
(h'+i — b) in correspondence of all step functions 

n— 1 

h = ^n/%,, r ,. +l) < / a = h < ■ ■ • < t n = b, (2.16) 

and the latter considering the infimum in correspondence of all step func¬ 
tions h > f. We denote by 1(f) the Riemann integral, equal to the upper 
and lower integral whenever the two integrals coincide. 

As the Lebesgue integral of the function h in (2.19) coincides with 
Y!'~ l ~ti), we have 

J g dk = I (g) for any step function g : J R. 
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Now, if / : J —► M is continuous, we can choose a uniformly bounded 
sequence of step functions g h converging pointwise to / (for instance 
splitting / into i equal intervals [v,, x,- +1 [ and setting a, = min| c; x;+l | /) 
whose Riemann integrals converge to 1(f). Therefore, passing to the 
limit in the identity above with g = g h , and using the dominated conver¬ 
gence theorem we get 


L 


f dX = I (/) for any continuous function / : J 


We are going to generalize this fact, providing a full characterization, 
within the Lebesgue theory, of Riemman’s integrable functions. 

Theorem 2.20. Let f : J = \a,b\ —> M be a bounded function. Then 
f is Riemann integrable if and only if the set of its discontinuity points 
is Lebesgue negligible. If this is the case, we have that f is 33 (J)im¬ 
measurable and 


Proof. Let 


l 


fdX=I(f). 


(2.17) 


f*(x) := inf ] liminf/(x;,) : jc* —> x 

h—> oo 


f*(x) := sup Ilimsup/Cr*) : x h x } . 

I h—>oo 


(2.18) 


It is not hard to show (see Exercise 2.6 and Exercise 2.7) that /* is lower 
semicontinuous and f* is upper semicontinuous, therefore both /* and 
f* are Borel functions. 

We are going to show that /*(/) = / y /* dk and /*(/) = fj f* dX. 
These two equalities yield the conclusion, as / is continuous at A.-a.e. 
point in J iff /*—/* = 0 A-a.e. in J, and this holds iff (because /*—/* > 
0) 


L 


(f*~f*)dX = 0. 


Furthermore, if the set of discontinuity points of / is A-negligible, the 
Borel function f* differs from / only in a A-negligible set, thus / is 
S3 (7); -mcasurable (because {/ > t} differs from the Borel set {/* > t] 
only in a /.-negligible set, see also Exercise 2.4) and its integral coincides 
with fj f* dX = fj /* dX\ this leads to (2.17). 
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Since /*(/) = —/*(—/) and f* = —(—/)*, we need only to prove the 
first of the two equalities, i.e. 


f*dk = /*(/) 


(2.19) 


j 


In order to check the inequality < in (2.19) we apply Exercise 2.11, find¬ 
ing a sequence of continuous functions gh \ f* < f and obtaining, 
thanks to the monotone convergence theorem, 

/ f*dX = sup / g h dX = sup I(g h ) = sup I*(g h ) < /*(/). 

Jj /ieN J J /isN /isN 

In order to prove > in (2.19) we fix a step function h < f in [a, b ) as in 
(2.16) and we notice that / > a, = h in (r,, t i+ \) implies /* > a,- in the 
same interval. Hence /* > h in J \ {t\, ..., t n } and, being the set of the 
tj ’s Lebesgue negligible, we have 



Since h is arbitrary the inequality is achieved. 


□ 


Exercises 

2.1 Show that any of the conditions listed below is equivalent to the ^'-mea¬ 
surability of (p : X -> R. 

(i) (p~ 1 ((—oo, f ]) c $ for all t e R; 

(ii) oo, ?)) c S for all teR; 

(iii) ^ _1 ([<7, b\) c £ for all a, b e R; 

(iv) b)) C £ for all a, b e 1; 

(v) cp~ l ((a, b)) c £ for all a, b e K. 

2.2 Let <p, \[r: X R be (^-measurable. Show that (p + i/r and ^i/r are £- 
measurable. Hint: prove that 


{(p + \// < t] — [J \{(p < r] n {i/r < t - r}] 



d(x, y) | arctanx — arctanyl 


where, by convention, arctan(ioo) = ±tt/2. 
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(i) Show that (R, d) is a compact metric space (the so-called compactification 
o/R) and that A C R is open relative to the Euclidean distance if, and only 
if, it is open relative to d ; 

(ii) use (i) to show that, given a measurable space (X, £), f : X -* R is £- 
measurable according to (2.2) if and only if it is measurable between § and 
the Borel a -algebra of (R, d). 

2.4 Let ( X , £, /i) be a measure space and let £ ^ be the completion of £ induced 
by //. Show that / : X —> R is d^-measurable iff there exists a (/-measurable 
function g such that {f ^ g} is contained in a //-negligible set of £. 

2.5 Let us define If as in (2.8) and let us endow E with the usual partial ordering 
a = {fi,..., t^} < £ = {si, ..., sm 1 if and only if er C f. Show that a h* 
I /(er) is nondecreasing. Use this fact to show that / (->• / 0 °° /(f) dt is additive. 

2.6 Let / : R —> R be a function. Show that the functions /*, /* defined in 
(2.18) are respectively lower semicontinuous and upper semicontinuous. 

2.7 Let / : R -> R be a bounded function. Using Exercise 2.6 show that 
{/* 5 f 1 and {/* > f} are closed for all t e R. In particular deduce that 

E = {x e R : /is continuous at x} 


belongs to J?(R). 

2.8 Let (a„) C (0, oo) with 

lim ai = 0. 
i—> oo 


oo 



i=0 


Show that for any <p : X —*► [0, +oo] ^-measurable there exist A/ e £ such that 
<p = E,' a i^A r Hint: set <P0 ■= V, := W > aol and <p { := <po - a 0 l Ao > 0. 
Then, set Ai := {cp\ > a \) and q>2 '.= (po — aii. A and so on. 

2.9 Let : X —»• R be //-integrable. Show that the property (2.15) holds. Hint: 
assume by contradiction its failure for some e > 0 and find A; with //(A,) < 2~‘ 
and f A , \(p\ dpi > e. Then, notice that B lim sup ( - A; is //-negligible, consider 

B n := |J Ai \ B \ 0 

i>n 


and apply the dominated convergence theorem. 

2.10 Prove that if <p n —► q> in L 1 (S2, £, //), then ( cp n ) is //-uniformly integrable. 
In addition, find a space (X, £, //) and a sequence (<p n ) that is //-uniformly 
integrable, for which there is no g e L l (X, £, //) satisfying \<p n \ < g for all 
n e N. 

2.11 Let (X, d) be a metric space and let g \ X —> [0, oo] be lower semicon¬ 
tinuous and not identically equal to oo. For any k > 0 define 
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Check that: 

(a) |gx(x) — gi(x')\ < Xd{x,x') for all x, x' e X\ 

(b) gx t 8 as * t °°- 

2.12 Let / : R 2 -» R be satisfying the following two properties: 

(i) x h* f(x, y ) is continuous in R for all yeK; 

(ii) y f(x,y) is continuous in R for all x e R. 

Show that / is a Borel function. Hint: first reduce to the case when / is 
bounded. Then, for e > 0 consider the functions 

i r x+e 

f e (x,y):=— f(x',y)dx', 

2e J x -s 

proving that f e are continuous and f e —> f as e \ 0. 


Chapter 3 

Spaces of integrable functions 


This chapter is devoted to the properties of the so-called L p spaces, the 
spaces of measurable functions whose p-th power is integrable. Through¬ 
out the chapter a measure space (X, S, /x) will be fixed. 


3.1. Spaces JS P (X, S, /x) and L P (X, S, /z) 


Let Y be a real vector space. We recall that a norm || • || on Y is a non¬ 
negative map defined on Y satisfying: 

(i) || y || = 0 if and only if y = 0; 

(ii) ||ay || = |a| ||y|| for all a e M and y e T; 

(iii) || yi + y 2 || < ||yi|| + ||y 2 || for all y u y 2 <= Y. 

The space Y, endowed with the norm || • ||, is called a normed space. 
Y is also a metric space when endowed with the distance d(y i, y 2 ) = 
||yi — y 2 || (the triangle inequality is a direct consequence of (iii)). If 
(T, d) is a complete metric space, we say that (7, || -1|) is a Banach space. 

We denote by Jz? 1 (X, S, /x) the real vector space of all /x-integrable 
functions on ( X , S). We define 




We have clearly 


\\cap\\\ = |a| H^lli Va e M, Vtp G £S x (X, S, /x), 


and 


11^ + flit < ll^lli + Ilf Hi V^), f e Jf x (X, S, /x), 

so that conditions (ii) and (iii) in the definition of the norm are fulfilled. 
However, || • || i is not a norm in general, since \\<p\\ \ = 0 if and only if 
cp = 0 /x-a.e. in X, so (i) fails. 
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Then, we can consider the following equivalence relation US on 
SS x (X, S, /x), 


<p ~ \js <=> (p — x// /x-a.e. in X (3.1) 

and denote by L l (X, §, fi ) the quotient space of S£ x {X, S , /x) with re¬ 
spect to SS. In other words, L 1 (X, S’, /x) is the quotient vector space 
of _§?■ 1 ( X , S, /x) with respect to the vector subspace made by functions 
vanishing /x-a.e. in X. 

For any ip e S/ A (X, S, /x) we denote by ip the equivalence class de¬ 
termined by (p and we set 

ip + ifr := (p + , aip := aip. (3.2) 

It is easily seen that these definitions do no depend on the choice of rep¬ 
resentatives in the equivalence class, and endow L x {X, S , /x) with the 
structure of a real vector space, whose origin is the equivalence class of 
functions vanishing /x-a.e. in X. Furthermore, setting 

ll^lli = IMIi. <p e L x (X,S, /x), 

it is also easy to see that this definition does not depend on the particular 
element (p chosen in tp, and that (ii), (iii) still hold. Now, if ||<p|| | = 0 
we have that the integral of \(p\ is zero, and therefore ip = 0. Therefore 
L l (X, S , /x), endowed with the norm || • ||i, is a normed space. 

To simplify the notation typically ip is identified with (p whenever the 
formula does not depend on the choice of the function in the equival¬ 
ence class: for instance, quantities as /x({<p > /}) or f x (p d/i have this 
independence, as well as most statements and results in Measure Theory 
and Probability, so this slight abuse of notation is justified. It should be 
noted, however, that formulas like <p(x) = 0, for some fixed x € X, do 
not make sense in L l (X, S, /x), since they depend on the representative 
chosen (unless /x({jc}) > 0). 

More generally, if an exponent p e (0, oo) is given, we can apply a 
similar construction to the space 

,X jp (X, S\ /i) : <p is (^-measurable and j \<p\ p d/i < oo 

Since |jc + y\ p < \x\ p + \y\ p if p < 1, and |x + y\ p < 2 p ~ l (\x\ p + 
\y\ p ) ii p > 1, it turns out that SS P (X, S, /x) is a vector space, and we 
shall denote by L P (X. S, pi) the quotient vector space, with respect to the 
equivalence relation (3.1). Still we can define the sum and product by a 
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real number as in (3.2), to obtain that L P (X, S, pc) has the structure of a 
real vector space. The case p = 2 is particularly relevant for the theory, 
as we will see. 

Sometimes we will omit either^ or pc, writing L P (X pi) or even L P (X). 
This typically happens when ( X, d ) is a metric space, and S is the Borel 
a -algebra, or when Xcl and pc is the Lebesgue measure. 

3.2. The L p norm 

For any cp e L p (X, S, pc) we define 



We are going to show that || • || p is a norm for any p e [ I. +oo). Notice 
that we already checked this fact when p = 1, and that the homogen¬ 
eity condition (ii) trivially holds, whatever the value of p is. Further¬ 
more, condition (i) holds precisely because L P {X, S, pc) consists, strictly 
speaking, of equivalence classes induced by (3.1). So, the only condi¬ 
tion that needs to be checked is the subadditivity condition (ii), and in the 
sequel we can assume p > 1. 

The concept of Legendre transform will be useful. Let / : M —> M be 
a function; we define its Legendre transform /* : M —> M U {+oo} by 

f*(y) = sup{xy - f(x)}, y e R. 

jcsR 

Then the following inequality clearly holds: 

*y < fix) + f*(y) Vi,ye M, (3.3) 

and actually f* could be equivalently defined as the smallest function 
with this property. 

Example 3.1. Let p > 1 and let 

if x > 0, 

if jc <0. 
we find that 

ify >0, 


/(*) = 


x p 

P 

0 


Then, by an elementary computation, 


roo = 


y_ 

q 


Too 


if y < 0, 
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where q = p/( p — 1) (equivalently, + 1 = 1). Consequently, the 
following inequality, known as Young inequality, holds: 

x p y q 

xy < — + —, x, v > 0. (3.4) 

P q 

Motivated by the previous example, we say that p and q are dual (or 
conjugate) exponents if - + - = 1, i.e. q = p/{p — 1). The duality 
relation is symmetric in (1, +oo), and obviously 2 is self-dual. 

Example 3.2. Let f(x) = e x , x £ M. Then 

+oo if y < 0, 

0 if y = 0, 

y log y — y if y > 0. 

Consequently, the following inequality holds: 

xy < e x + y log y — y, x, y > 0. (3.5) 


f*(y) := supjvy - e x } = 

xe R 


3.2.1. Holder and Minkowski inequalities 

Proposition 3.3 (Holder inequality). Assume thattp e L P (X, S\ /x) and 
\jj e L q (X, S’, /i), with p and q dual exponents in (1, +oo). Then 
(pxfr £ L l (X, S , pi) and 




(3.6) 


Proof. If either ||^|| p = 0 or \\iff\\ q = 0 then one of the two functions 
vanishes /x-a.e. in X, hence (pij/ vanishes /z-a.e. and the inequality is 
trivial. If both \\ip\\ p and || \j/ \\ q are strictly positive, by the 1-homogeneity 
of the both sides in (3.6) with respect to </? and \//, we can assume with no 
loss of generality that the two norms are equal to 1. 

Now we apply (3.4) to \(p{x)\ and \ ij/(x) \ to obtain 


I <p(x)1r(x)\ < 


\<P(x)\ p If (^)l g 

p q 


Integrating over X with respect to p yields 


f 1 1 

/ |<p(x)f (x)| dpL(x) <—b - = 1. 
Jx p q 


□ 


A particular case of the Holder inequality is 


/ (p(x)xls(x) dp,(x) 

Jx 


(/. 


< I / (p (x) dp(x) 
'x 


1/2 


(/. 


f 2 {x) dpi{x) 


1/2 


It also follows, as we shall see, from the Cauchy-Schwarz inequality of 
scalar products. 
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Proposition 3.4 (Minkowski inequality). Assume that p e [1,+oo) 
and i -p, iff e. L P (X, S, pi). Then (p + \[r £ L P (X, S, pi) and 

\\<P + 'l'\\ P <\\<p\\ P + W\\ P - (3.7) 


Proof. The cases p = 1 is obvious. Assume that p e (1, +oo). Then we 
have 

[ \<p + i/s\ p dpi < f \(p + x//\ p ~ l \(p\dpi + f \cp + f\ p ~ l \x//\dpi. 

Jx Jx Jx 

Since \(p + x//\ p ~ l € L q ( X, S, pi) where q = p/{p — 1), using the Holder 
inequality we find that 


[ \<p + x[r\ p dp. 


< 


(( 


i i i n 


\ 1/9 


and the conclusion follows. O 

By the previous proposition it follows that || • || p is a norm on L P (X. S, ii). 


3.3. Convergence in L P (X, S, /t) and completeness 

We have seen in the previous section that L P (X, S, pi) is a normed space 
for all p e [1, +oo). In this section we prove some properties of the con¬ 
vergence in these spaces, obtaining as a byproduct the following result. 

Theorem 3.5. L P (X, S, pi) is a Banach space for any p e [1, +oo). 

This theorem will be a direct consequence of the following proposition, 
that provides also a relation between convergence in L p and convergence 
pi- a.e. in X. 


Proposition 3.6. Let p e [1, +oo) and let (<p n ) be a Cauchy sequence in 
L p (X, S, pi). Then: 

(i) there exists a subsequence ((p n (k) ) converging pi-a.e. to a function q> 
in L p (X, S, pi); 

(ii) (cp n ) is converging to <p in L P (X, S , pi), so that L P (X, S , pi) is a 
Banach space. 

Proof. Let ( ip n ) be a Cauchy sequence in L P (X, S, pi). Choose a sub¬ 
sequence {ipn(kf) such that 

I \<P n(k+ 1) — <Pn(k)\\p < 2 k 


Wk e N. 
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Next, set 


g(x) ■- ^ | (p n( k+i){x) - (p n (k)(x)\, x e X. 
k =0 

By the monotone convergence theorem and the subadditivity of the L p 
norm it follows that 

g p (x)dpi(x = Jim 

N -1 

: lim 2~ k = 2 < oo. 

N-?oo 

k =0 




Therefore, g is hnite /r-a.e., that is, there exists B e S’ such that fk( B) = 
0 and g(x) < oo for all x e B c . Set now 

OO 

<p(x) := (Pn(0) (x) + y^((Pn(k+ l)(x) - <?„(£) (*)), X G B°. 
k=0 


The series above is absolutely convergent for any x e B c ] moreover, re¬ 
placing the series in the definition of cp by the finite sum 1 ((pnik+i) (x) 
— < p n (k)(x )) we obtain <p(x) = Iim* i p n (k)(x). Therefore, if we define (for 
instance) (p = 0 on the /x ncgligible set B, we obtain that <p n (k) <p 
/x-a.e. on X. 

The inequality \(p\ < \(p n ( 0 )\ + g gives that \(p\ p is /x-integrable, so that 
(p e L P (X, S’, n). So, (i) is proved. 

In order to prove (ii), we first claim that (p n (k) <P in L P (X, £, /x) as 
k —»■ oo. In fact, since 


OO 

\<P(X) - <p„(h)(x) | < ^ \(pn(k+l){x) - (p n (k){x)\, X G X, 
k=h 


we have, again by monotone convergence and subadditivity of the norm. 


(/, 


I <P(X) - (p n (h)(x)\ P d/l(x) 


i Ip 


< 


OO / n \ 1 Ip OO 

( / \Vn(k+\)(x) - (P, m {x)\ p dn(x)\ <^2~ k , 
k=h X ' k=h 


and the claim follows. 
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Since (<p„) is Cauchy, for any s > 0 there exists n, e N such that 

n, m > n E => || cp n - (p m \\ p < £• 

Now choose leN such that n(k) > n E and \\ip — (p n (k)\\ P < £■ For any 
n > n E we have 

II ^nllp — Wty Tn(k) II P T" ll^n(fc) 'Pnllp ^ 2fi. CH 

Remark 3.7 ( L p convergence versus /x-a.e. convergence). The argu¬ 
ment used in the previous proof applies also to converging sequences 
(as these sequences are obviously Cauchy), and proves that any sequence 
(<p n ) strongly converging to cp in L P (X,S, //) admits a subsequence (<p n (k)) 
converging /x-a.e. to <p: precisely, this happens whenever 

OO 

^ ' Il'Pntir+l) lip ■ 

0 

In general, however, convergence in L p does not imply convergence /x- 
a.e.: the functions 

<Po — V.i] 

<Pl = V.l/2]’ ^2 = 1[1/2,1] 

^3 = %),l/3]> ^4 = l[l/3,2/3]> <P5 = %/3,l] 


converge to 0 in L p { 0, 1), but are nowhere pointwise converging. 

The previous remark shows that we can expect to infer pointwise con¬ 
vergence from convergence in L p only modulo the extraction of a sub¬ 
sequence. Now, we ask ourselves about the converse implication: given 
a sequence (</>„) in L P (X, S, /x) pointwise converging to a function <p e 
L P (X, S, fi), we want to find conditions ensuring the convergence of 
{(p n ) to (p in L P (X, S, pi). This is not true in general, as the following 
example shows. 

Example 3.8. Let X = [0, 1], S = ^([0, 1]) and let pi = A be the 
Lebesgue measure. Set 


(Pn(x) = 


n if x e [0, 1/n], 

0 if * e [1/n, 1]. 


Then ip n (x) —»■ 0 for all x e (0, 1] but ||<p„||i = 1. 
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In the next proposition we assume that /x is a finite measure, since we 
defined /x-uniform integrability only for finite measures /x. 

Proposition 3.9. Let ( cp n ) be a sequence in L P (X, S, /x) poinlwise con¬ 
vergent to a function (p G L P (X, S, /x), with (|<p„| p ) /x— uniformly integ- 
rable. Then cp n —> cp in L P (X, S, /x). 

Proof. The functions h n := \<p n — tp\ p are pointwise converging to 0 and, 
because of the inequality 

hn <2 P ~\\(p n \ P + \cp\ p ), 

they are also easily seen to be uniformly /x-integrable. Therefore, by 
applying Yitali Theorem 2.18 to h n we obtain the conclusion. □ 

3.4. The space L°°(X, $, ft) 

Let i p: X —> 8 be a A-measurablc function. We say that <p is /x— 
essentially bounded if there exists a real number M > 0 such that 

/x({|«p| > AT}) = 0. 

If cp is /x-essentially bounded there exists a nonnegative number, denoted 
by MU such that 

Halloo = min {t > 0 : /i({\(p\ > f}) = 0}. (3.8) 

This easily follows from the fact that the function t —> ii({\<p\ > t }) is 
right continuous (Proposition 2.6), so the infimum is attained. 

Notice also that ||<p||oo is characterized by the property 

Moo < M W\ < M yU-a.e. in X. (3.9) 

We shall denote by L°°(X , S , /x) the space of all equivalence classes of 
/x-essentially bounded functions with respect to the equivalence relation 
~ in (3.1), thus identifying functions that coincide /x-a.e. in X. 

Several properties of the L p spaces extend up to the case p = oo: first 
of all L°°(X, S, /x) is a real vector space and we have the Minkowski 
inequality 

\\<P + Vico < Halloo + Halloo- (3.10) 

Indeed, by (3.9) and the triangle inequality, | <p(x) + f(x)\ < ||^||oo + 
H^Hoo /x-a.e. in X, therefore (3.8) provides (3.10). As a consequence, 
L°°(X, S , /x) endowed with the norm || • H^, is a normed space. 
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The Holder inequality takes the form 

f \<P'l'\d/i < IMloo f \fi\dpi. (3.11) 

Jx Jx 

Indeed, we have just to notice that \ip(x)fi(x)\ < ||<p||ooll^C*)I for /r-a.e. 
x € X, and then integrate with respect to /x. This inequality can be still 
written as (3.6), provided we agree that q — 1 is the dual exponent of 
p — o o (and conversely). 

For finite measures we can apply Holder’s inequality to obtain that the 
L p spaces are nested; in particular L°° is the smaller one and L 1 is the 
larger one. 

Remark 3.10 (Inclusions between L p spaces). Assume that /x is finite. 

Then, ifl<r<s<oowe have 


L r (X, g, ix) D L S {X, g, pi). 


In fact, if r < s and <p e L'(X, g, n) we have, in view of the Holder 
inequality (with p = s/r and q = s/(s — r)). 


I, 


U 


\<p(x)\’ dp(x) < / \cp{x)\ s dfi(x) 


r/s 


a. 


t x dfi(x) 


l—r/s 


and so 

Mr < (/x(X))^/-||^|| s . (3.12) 

By (3.12) we obtain that p i-* pl{X)~ 1 I P \\(p\\ p is nondecreasing for <p in 
the intersection of the spaces L P (X , S\ /x), so that it has a limit as p 
oo. Since fx(X)~ l / p —>• 1 as p —>■ oo we obtain that lim^oo \\(p\\ p exists, 
finite or infinite. The following proposition characterizes L°°(X, g, /x) 
and the L°° norm in terms of this limit. 


Proposition 3.11. Assume that /x is finite and let ip be in the intersection 

Pi L p (x,g,ii). 

p<o o 

Then ip e L°°(X, g, pi) if and only if the limit lim^oo ||^|| p is finite. If 
this is the case, we have that ||^||oo coincides with the value of the limit. 

Proof. If p > 1 we have by the Markov inequality 

d({\(p\ > a}) = p,({\(p\ p > a p }) < a~ p \\ip\\ p . 

Consequently, ||<p|| p > ap>({\ip\ > a }) l ^ p , which yields lim p ||^|| p > 
a whenever pi({(p > a}) >0. So, if the limit is finite, we have ip e 
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L°°(X, S, pi) and ||^||oo < lim p \\(p\\ p . The converse inequality follows 
directly from (3.11); the same inequality also proves that if the limit is 
not finite, then ip ^ L°°(X, S, pi). □ 


In the next remark we characterize the convergence in L°°, proving also 
that L°°(X, S, pi) is a Banach space: as a matter of fact, convergence 
in L°°(X, S, pi) differs from the convergence in supremum norm only 
because a /z-negligible set is neglected. 

Remark 3.12 ( L°°(X , S. //,) is a Banach space). Assume that (<p n ) C 
L°°(X, S, pi) is a Cauchy sequence, and let us consider the //-negligible 
set 

OO 

[J {x e X : | (p„(x) - (p m (x) \ > || ip n - cp m ||oo) • 

n, m =0 

Then sup fic | <p n — ip m \ < \\(p n — ip m H^; as a consequence, the complete¬ 
ness of the space of bounded functions defined in B c provides a bounded 
function ip : B c —► M such that cp n <p> uniformly in B c . Extending cp 
in an arbitrary S --measurable way (for instance with the 0 value) to the 
whole of X, we get ip„ ip m L°°(X, S, pi). 

A similar argument proves that <p n ip in L°°(X, S, pi) if and only if 
there exists a /x-negligible set B e S satisfying < p n —> ip uniformly in 
B c . 

We know that | f x ip dpi\ does not exceed f x \ip\ dpi. A nice and useful 
generalization of this fact is the so-called Jensen inequality. 

Recall that, if / c M is an interval, a continuous function g : J —>■ M 
is said to be convex if 


g 


< 




(3.13) 


By several approximations (see Exercise 3.7) one can prove that a convex 
function / satisfies g(tx + (\—t)y) < tg(x) + (l — t)g(y) forallx, y € J 
and t e [0, 1], and even that 


( n \ n n 

Y, t t x; J < tig(xi) whenever t, >0, x t e J and 'Y t, = 1. 

i = l / i=l i= 1 

(3.14) 

In the proof we use an elementary property of convex functions g : M —»■ 
M satisfying g{t) —> +oo as |t| —> +oo, namely the existence of a 
minimum point / 0 ; moreover, the function g is nondecreasing in | r 0 , +oo) 
and nonincreasing in (—oo, / 0 1 (see Exercise 3.8). 




55 Introduction to Measure Theory and Integration 


Proposition 3.13 (Jensen). Assume that p is a probability measure. Let 
g: M — ► M be convex and bounded from below and let tp e L 1 (X, S, gt). 
Then we have 

g(^J (pdp'j < J g{<p)dp. (3.15) 

Proof. Let us first show (3.15) when (p is simple. Let 

n 

i=i 

where n > 1 is an integer, a\,... ,a n el and A\, ..., A n are mutually 
disjoint sets in S whose union is X, so that 

n 

^/x(A,) = 1. 

/=! 


Then, from (3.14) we infer 


g 




otiP-(Ai) 


n 

< Y2g(oti)ix(Ai) 

/=! 


L 


g{tp) dp.. 


In the general case, let us first assume that g(t) — ¥ +oo as |/1 —y +oo. 
Then, by Exercise 3.8 we know that g has a minimum point t 0 , and that 
g is nondecreasing in [? 0 , +oo), and nonincreasing in (— oo, t 0 |. We can 
assume with no loss of generality (possibly replacing g(t) by g(t — t 0 ) 
and (p by tp + to) that g attains its minimum value at to = 0, and that 
fx g^P) dp ' s finite. Furthermore, replacing g by g—g(0), we can assume 
that the minimum value of g is 0. 

Let cpf be nonnegative simple functions satisfying <pf j" (p ^; the simple 
functions <pf — <pf converge to cp + — tp~ = q> in L 1 (X, S, p). In addition, 
since g is monotone in (—oo, 0] and [0, +oo), the monotone convergence 
theorem gives 

/ g(<pf)dp\ / g(<p + )dp, / g{-tp~)dp t / g(—<p~) dfi, 

Jx Jx Jx J 

so that (since g(0) = 0, <p+cpf = 0 and <p + <p~ = 0) f x g(<p+ - <p~) dp = 
f x g(<pf) dp + f x g(~(pf ) converges to f x g((p+)dp + f x g(~<P~) = 
fx g(fP) dp. Passing to the limit as n —> oo in Jensen’s inequality for the 
simple functions <pf — <pf 


g 



(fPt ~ <Pn ) dp 


< / g(<Pt ~<Pn)dp 
Jx 


we get (3.15). 
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Finally, the assumption that g{t ) +oo as t +oo can be removed 
by considering the functions g E (t) := g(t) + e\t\, which converge to +oo 
as |/1 —> oo, thanks to the fact that g is bounded from below: we obtain 


CMH/« 


cpdjJL 


< 


/ g((p)dfi + s 

Jx Jx 


\cp\dfi. 


and Jensen’s inequality follows by letting e l 0. □ 

An alternative proof of Jensen’s inequality is based on another viewpoint, 
namely the representation of g as the supremum of a family {L, }, e / of 
affine functions. Since /x is a probability measure, for all ; e / it is easy 
to check that L, (f (pdfi) = f L, (up) dp,, so that 


w (J (pdp,) < 


L 


L((p ) dji V/ e /. 


Taking the supremum in the right hand side we obtain Jensen’s inequality. 
Both viewpoints are important in the theory of convex functions. 

To be more precise, Jensen’s inequality holds provided g is convex on 
an interval containing the image of (p. The next example is very important 
in Probability and Information theory. 

Example 3.14 (Entropy functional). By applying Jensen’s inequality 
with the convex function g(z) = z lnz in [0, +oo) we obtain 


/ (p\rupdii> / (pd/ilnl / cpd/i 
Jx Jx \Jx 


(3.16) 


for all (p e L'(X,(o,ii) nonnegative. If f x (pd/i = 1 we obtain that 
f x (p\n(pd[i >0 even though the function g has a variable sign (it attains 
the minimum value — l/e at z = 1/e). 


3.5. Dense subsets of L P (X, //) 

Proposition 3.15. For any p e [1, +oo], the space of all simple p- 
integrable functions is dense in L P (X, S’, pf. 

Proof. Let / e L P (X, S, /i) with / > 0. Then the conclusion follows 
from Proposition 2.12 (by Proposition 2.4 in the case p = oo) and the 
dominated convergence theorem. In the general case we write / as f + — 
f~ and approximate in L p both parts by simple functions. □ 

We consider now the special situation when X is a metric space, S is 
the cr-algebra of all Borel subsets of X and /x is any finite measure on 
(X, S). 

We denote by Cb(X) the space of all continuous bounded functions on 
X. Clearly, C h (X) C L P (X, S, /x) for all p e [1, +oo]. 
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Proposition 3.16. For any pe[ 1, +oo) and any finite measure fi, C/,(X) 
is dense in U'(X, S, /x). 


Proof. Let ‘S be the closure of C/,(X ) in L P (X, S, /x); obviously 2? is a 
vector space, as Cb(X) is a vector space. In view of Proposition 3.15 it is 
enough to show that for any Borel set I e S3 (X) there exists a sequence 
(tp n ) C C b (X) such that tp n —> lj in L P (X, S, /x). 

Assume hrst that / is closed. Set 


<Pn U) = 


1 — n d(x, I) if d(x, 1) < ] - 
0 if d(x, /) > 


where 

d(x, I) := inf{|x — y| : ye I}. 

It is easy to see that cp n are continuous, that 0 < <p„ < 1 and that tp n (x ) —> 
t,(x), hence the dominated convergence theorem implies that (p„ —> 1/ in 
L p (X, S, /x). 

Now, let 

& :={I e 3d(X) : I; e 

It is easy to see that <£ is a Dynkin system (which includes the xr-system 
of closed sets), so that by the Dynkin theorem we have = SS ( X ). □ 

Remark 3.17. Cb(X) (or more precisely, the equivalence classes of con¬ 
tinuous bounded functions) is a closed subspace of L°°(X, S, /x), and 
therefore it is not dense in general. Indeed, if (c p n ) C C/,(X) is Cauchy 
in L°°(X, S, /x), then it uniformly converges, up to a //-negligible set 
B (just take in Remark 3.12 as B the union of the /x-negligible sets 
{\(p n —(p m \ > \\(p n — <Pm\\})- Therefore {(p n ) uniformly converges on B c and 
on its closure K. Denoting by <p> e Cb(K) its uniform limit, by Tietze’s 
exension theorem we may extend <p> to a function, that we still denote by 
i -p, in Cb(X). As X \ K c B is /x-negligible, it follows that (p„ <p in 
L°°(X, S, /x). 

Exercises 

3.1 Assume that /x is a-finite, but not finite. Provide examples showing that no 
inclusion holds between the spaces L P (X, S, /x) in general. Nevertheless, show 
that for any ^-measurable function (p : X —> R. the set 

{p e [l,oo] : cpe L p (X,S,pi)} 

is an interval. Hint: consider for instance the Lebesgue measure on R. 
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3.2 Let 1 < p < q < oo and / e L q (X , S, p ,). Show that for any S e (0, 1) 
we can write f = g + f, with g e L q (X, S, p), / e L P (X, S, p) and ||g|| 9 < 
S || /|| q (notice that if p is finite we can take g = 0). 

3.3 Let p e (1, oo), q> e L p and i jr e L q , with q = p' , be such that ||^t/t||i = 
\\(p ||p || i/f || q . Show that either i/r = 0 or there exists a constant X e [0, +oo) such 
that |^)| = X\i//\ q ~ l /x-a.e. in X. Hint: first investigate the case of equality in 
Young’s inequality. 

3.4 Prove the following variant of Holder’s inequality, known as Young’s in¬ 
equality: if q> e L p , \[r e L q and -f ^ with r > 1, we have that <p\jr e L' 
and || <pf\\ r < \\(p\\ p \W\\ q . 

3.5 Let (cp n ) C L l (X, S, pt) be nonnegative and satisfying liminf„ (p n > <p p,- 
a.e. in X. Show that 



Hint: notice that the positive part and the negative part of <p — <p n have the same 
integral to obtain 



Then, apply the dominated convergence theorem. 

3.6 Show that the following extension of Fatou’s lemma: if (p n > — i/r„, with 
t/t„ e L l (X) nonnegative, i/f„ —i/f in L l (X), then 



Hint: prove first the statement under the additional assumption that i jr n —*■ x/r 
/x-a.e. in X. 

3.7 Show that (3.13) implies g(tx + (1 — t)y) < g{x) + (1 — t)g{y) for all 
x, y e J and t e [0, 1], Then, deduce from this property (3.14). Hint: it is 
useful to consider dyadic numbers t = k/2 m , with k < 2'" integer. 

3.8 Let g : R —»• K be a convex function such that g(z) — > +oo as |z| —> +oo. 
Show the existence of jo £ R where g attains its minimum value. Then, show 
that g is nondecreasing in [zo, +oo) and nonincreasing in (—oo, zoL 

3.9 Let ((p n ) C L 1 (X, S , /i) be nonnegative functions. Show that the conditions 




imply the convergence of < p n to <p in L 1 (X, S, n). Hint: use Exercise 3.5. 

3.10 Let {(Pi}i e i be a family of functions satisfying 
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and assume that <J>(c)/c is nondecreasing and tends to +oo as c —> +oo. Show 
that \<pi }, e / is fi -uniformly integrable. Hint: use the inequalities 


/ WildII < 

Ja J a 


^n{|p,-|>c} 


^d, + f 

'I'(c) J AC\{\<pi\<c) 


Wi\dn < 


M 

>h(c) 


+ cn(A), 


with 'h (c) := $ (c)/c, and then choose c sufficiently large, such that M/ '\> (c) < 
ell. 

3.11 ★ Assuming that (X, d) is a metric space, S — SS (X) and /i is finite, prove 
Lusin’s theorem: for any e > 0 and any / e /J (A, S, /x), there exists a closed 
set C C X such that fi{X \ C) < e and f\c is continuous and bounded. Hint: 
use the density of Cb(X ) in L 1 and Egorov’s theorem. 




Chapter 4 
Hilbert spaces 


In this chapter we recall the basic facts regarding real vector spaces en¬ 
dowed with a scalar product. We introduce the concept of Hilbert space 
and show that, even for the infinite-dimensional ones, continuous linear 
functionals are induced by the scalar product. Moreover, we see that even 
in some classes of infinite dimensional spaces (the so-called separable 
ones) there exists a well-defined notion of basis (the so-called complete 
orthonormal systems), obtained replacing finite sums with converging 
series. Even though the presentation will be self-contained, we assume 
that the reader has already some familiarity with these concepts (basis, 
scalar product, representation of linear functionals) in finite-dimensional 
spaces. 

4.1. Scalar products, pre-Hilbert and Hilbert spaces 

A real pre-Hilbert space is a real vector space H endowed with a map¬ 
ping 

H x H —»■ M, (x, y) —»■ (x, y), 

called scalar product, such that: 

(i) (x, x) > 0 for all x e H and (x, x) = 0 if and only if x = 0; 

(ii) (x, y) = ( y , x) for all x, y € H\ 

(iii) (ax + fiy, z) — a(x, z) + f)(y, z) for all x, y, z e H and a, /3 e M. 

In the following H represents a real pre-Hilbert space. 

The scalar product allows us to introduce the concept of orthogonality. 
We say that two elements x and y of H are orthogonal if (x, y) = 0. 

We are going to prove that the function 

11x|| := yj (x,x), x e H 

is a norm in H. For this we need the following Cauchy-Schwartz in¬ 
equality. 
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Proposition 4.1. For any x, y e FI we have 

l(*,y)l<IWI lb'll- (4.i) 

In (4.1) equality holds if and only ifx and y are linearly dependent. 

Proof. Set 

F(k) = \\x + Ay|| 2 = A 2 ||y|| 2 + 2A(x, y) + ||x|| 2 , A e M. 
Since F{ A) > 0 for all A e M we have 

l(x,y>| 2 -||x|| 2 ||y|| 2 < 0, 


which yields (4.1). 

If x and y are linearly dependent, it is clear that \(x, y)\ = ||x|| ||y||. 
Assume conversely that (x,y) = ±||x|| ||y|| and that y 0. Then 
we have F( A) = (||x|| ± A||y||) 2 so that, choosing A = ^||x||/||y||, we 
find F( A) = 0. This implies x + Ay = 0, so that x and y are linearly 
dependent. □ 

Now we can prove easily that || - || is a norm in H. In fact, it is clear 
that 11ax || = |a|||x|| for all a e M and all x e H. Moreover, taking into 
account (4.1), we have for all x, y e H , 

IIx + y11 2 = (x + y,x + y> = ||x|| 2 + ||y|| 2 + 2(x, y) 

< ll^ll 2 + Ibll 2 + 2||x|| ||y|| = (||x|| + ||y||) 2 , 
so that ||x + y || < ||x|| + ||y||. 

Therefore a pre-Hilbert space H is a normed space and, in particular, 
a metric space. If H , endowed with the distance induced by the norm, is 
complete we say that H is a Hilbert space. 

Example 4.2. (i). M" is a Hilbert space with the canonical scalar product 

n 

(x, y) := ^2 x kyk, 

k= 1 

inducing the Euclidean distance, where x = (xi, ..., x„), y = (yi, ... 

• • •, y n ) e M”. 

(ii). Let (A, S', p.) be a measure space. Then L 2 (X, S’, pi), endowed with 
the scalar product 

(<p,i]f) \= / ip{x)f(x) dpi(x) ip, f e L 2 (X, S', p), 

Jx 

is a Hilbert space (completeness follows from Proposition 3.5). 
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(iii) . Let l 2 be the space of all sequences of real numbers x = (x k ) such 

OO 

that Y2 x k < 00 • is a vector space with the usual operations, 

k=0 

a(x k ) = ( ax k ) ael, (x k ) + (y k ) = (x k + y k ), (x k ), (y k ) e i 2 . 
The space l 2 , endowed with the scalar product 

OO 

(x, y) := ^xkyk, x = (x k ), y = (yk) e V 2 
k =0 

is a Hilbert space. This follows from (ii) taking X = N, S’ = & (X) and 
/x({x}) = 1 for all x e X. 

(iv) . Let X = C([0, 1]) be the linear space of all real continuous func¬ 
tions on [0, 1], X is a pre-Hilbert space with the scalar product 

(/> g) ■= f f(t)g(t)dt. 

Jx 

However, X is not a Hilbert space: indeed, X is dense, but strictly con¬ 
tained, in L 2 (0, 1). 

Finite-dimensional pre-Hilbert spaces H are always Hilbert spaces: 
indeed, if {iq,..., v n }, with n = dim H, is a basis of H, the Gram- 
Schmidt orthonormalization process (recalled in Exercise 4.3) provides 
an orthonormal basis { e\ . e n } of H (i.e. ||e, || = 1 and e ; - is ortho¬ 

gonal to cj for i ^ j), and the map 

n 

x = ^(x, ei)ei ^ ({x, ei), {x, e 2 ), ...,{x, e„)) 

i =1 

(mapping x to the Euclidean vector of its coordinates with respect to this 
basis) is easily seen to provide an isometry with M": indeed, 

n n n 

|| ^(x, e,)c/1| 2 = ^ (x, ej){x, ej){ei, ej) = ^((x,c,-)) 2 . 

i=i 1 1=1 

Thus, being M" complete, H is complete. 

4.2. The projection theorem 

It is useful to notice that for any x, y € H the following parallelogram 
identity holds: 

IIx + v|| 2 + ||x — v|| 2 = 2||.r|| 2 + 2||y|| 2 , x, y e H. (4.2) 

One can show that identity (4.2) characterizes pre-Hilbert spaces among 
normed spaces, and Hilbert among Banach spaces, see Exercise 4.1. 
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Theorem 4.3 (Projection on closed subspaces). Let H be a Hilbert 
space and let Y be a closed subspace of H. Then for any x £ H there 
exists a unique y £ Y, called projection of x on Y and denoted by jty(x), 
such that 

\\x - y II = min ||x - z||. 

zeY 

Moreover, y is characterized by the property 

{x — y, z) = 0 for all z £ Y. (4.3) 

Proof. Set d := inf^y ||x — z|| and choose (y n ) C Y such that ||x — y n || 
d. We are going to show that (y„) is a Cauchy sequence. 

For any m, n e N we have, by the parallelogram identity (4.2), 

II (x -y n ) + (x - y m ) ||■ 2 +1| (x - y n ) - (x - y m ) || 2 = 21|x - y n || 2 +2||x - y m || 2 . 


Consequently 


\\Yn — ym\\ 2 = 2||x — y„|| 2 + 2||x — y m || 2 — 4 



y» + y,„ 
2 


2 


Taking into account that (y„ + y m )/2 £ Y we find 

\\y* - ym II 2 < 2||X - yj 2 + 2||x - y m II 2 - 4c? 2 , 


so that || y„ — y m \\ —> 0 as n, m —> oo. Thus, (y„) is a Cauchy sequence 
and, since the space is complete and Y is closed, it is convergent to an 
element y e Y. Since ||x — y„|| —> ||x — y|| we find that ||x — y|| = d. 
Existence is thus proved. Uniqueness follows again by the parallelogram 
identity, that gives 


ILv - /II 2 < 2||x - y|| 2 + 2||x - y'H 2 - 4 



y + y' 
2 


< 2d 2 + 2d 2 - Ad 2 = 0 


whenever y and / are minimizers. 

Let us prove (4.3). Define 

F{X) = ||x — y — Az|| 2 — A 2 ||z|| 2 - 2A(x - y, z) + ||x - y|| 2 , A £ M. 

Since F attains a minimum at A = 0, we have F'( 0) = (x — y,z) = 0, 
as claimed. 

Conversely, if (4.3) holds for all z £ F, we have 

llx - y - z|| 2 = ||z|| 2 + ||x - y || 2 > ||x - y|| 2 . □ 
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Remark 4.4 (Projection on convex closed sets). The previous proof 
works, with absolutely no modification, to show that for any convex 
closed set K c H and any x e H there exists a unique solution y = 
ttk(x) to the problem 

min \\x — z\\. 

zeK 

In this case, however, ttk(x) is not characterized by (4.3), but by a one¬ 
sided condition, namely (x — jtk(x), z — tzk(x)) < 0 for all z e K, see 
Exercise 4.2. 

Corollary 4.5. Let Y be a closed proper subspace of H. Then there ex¬ 
ists xq £ H \ {0} such that (xq, y) = Ofor all y e Y. 

Proof. It is enough to choose an element zo in H which does not belong 
to Y and set x 0 = zo - tt y (z o)- □ 


Fix an integer n > 1, a /;-dimensional subspace H n c H and an 
orthonormal basis {e\, ..., e n } of it. The following result characterizes 
the projection on H „, giving the best approximation of an element .r by a 
linear combination of {ei, ..., e,,}. 

Proposition 4.6. The projection of an element x e H on H n is given by 

n 

Xh,M) = e ^ ek - 

k=\ 


Proof. We have to show that for any yi, ■ •., y n G M we have 


n 

2 

n 

X - ^2 X k e k 
k= 1 

< 

V - T: y k e k 

k= 1 


where Xk = (x, ef). We have in fact 

n ^ n n 


x ykSk 


+ Y! yk ~ 2 ^2 x k y k 


k= 1 


k =1 k =1 


(4.4) 


= Ikll 2 - Yh X k + _ yk)1 - 

k= 1 k= 1 

This quantity is clearly minimal when x k = y k , and 

2 

12 


X -'Yl Xk6k 

k= 1 


= -V 


~I2 X l 


(4.5) 


k= 1 


An alternative proof of the Proposition, based on the characterization 
(4.3) of n h„ (v), is proposed in Exercise 4.4. □ 
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4.3. Linear continuous functionals 

A linear functional F on H is a mapping F : H —> M such that 

F(ax + fy) = aF(x ) + fF(y) Vx, y e H, Va, f £ R. 

F is said to be bounded if there exists K > 0 such that 
|F(x)| < K\\x\\ for all x & H. 

Proposition 4.7. A linear functional F is con tinuous if and only if, it is 
bounded. 

Proof. It is obvious that if F is bounded then it is continuous (even 
Lipschitz continuous). Assume conversely that F is continuous and, by 
contradiction, that it is not bounded. Then for any n e N there exists 
x n e H such that \F(x n )\ > « 2 ||x„||. Setting y n = \x n /\\x n \\ we have 
||y„|| = I —>• 0, whereas F(y n ) > n, which is a contradiction. □ 

The following basic Riesz theorem, gives an intrinsic representation 
formula of all linear continuous functionals. 

Proposition 4.8. Let F be a linear continuous functional on H. Then 
there exists a unique Xq G H such that 

F(x) — (x,x o) Vx e FI. (4.6) 

Proof. Assume that F / 0 and let Y = F~ { (0) = Ker F. Then Y f H is 
closed (because F is continuous) and a vector space (because F is linear), 
so that by Corollary 4.5 there exists Zo £ H such that F(zo) = 1 and 

{zo, z) = 0 for all ze Ker F. 

On the other hand, for any x e H the element z = x — F(x)z o belongs 
to KerF since F(z) = F(x) — F(x)F(z,o) = 0. Therefore 

(z 0 , x - F(x)zo) = 0 for all x e H , 


so that 

(x, zo) - F(x)||zol| 2 = 0 

and (4.6) follows setting x 0 = Zo/llzoII 2 - 

It remains to prove the uniqueness. Let y 0 e H be such that 


F(x) = (x, xo) = (x, yo)> x e H. 

Then, choosing x = xo — yo we hnd that 11 xo — To 11 2 = 0, so that 
x 0 = yo. □ 
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4.4. Bessel inequality, Parseval identity and orthonormal sys¬ 
tems 

Let us discuss the concept of basis in a Hilbert space H, assuming with 
no loss of generality that the dimension of H is not finite. We use Kro- 
necker’s notation 8 kk , equal to 1 for h = k and equal to 0 if h = k. 

Definition 4.9 (Orthonormal system). A sequence (e*)*€N C H is call¬ 
ed an orthonormal system if 


(e h ,e k ) = 8, hk , h,ke N. 


Proposition 4.10. Let ( e k ) ke pj be an orthonormal system in H. 

(i) For any x G H we have 

OO 

^l(*,e*>| 2 < Ml 2 - (4.7) 

k=0 

OO 

(ii) For any x G th the series ff (-L e k )e k is convergent in H 11 

k =0 

(iii) Equality holds in (4.7) holds if and only if 

OO 

x - y^fx, e k )e k . (4.8) 

k =0 


Inequality (4.7) is called Bessel inequality and when the equality holds, 
Parseval identity. 

Proof, (i) Let n e N. Then by (4.5) we have 


e k )e k 


k =o 


= \\x\\ 2 -J2\(x,e k )\ 2 


(4.9) 


k =0 


so that (4.7) follows by the arbitrariness of n. 
(ii) Let n , p e N and set 


s n = e k )e k . 

k =0 


A series x i vectors in a Banach space E is said to be convergent if the sequence of the 
k =0 
n 

finite sums X( is convergent in E 
k =0 
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Then 


II £«+/;> S n \\ — 


n+p 

Y {x,e k )e k 

k=n +1 


n+p 

= Y l< x ’ e dl 2 

k=n +1 


oo 

Since the series | (x, e k )\ 2 is convergent by (i), the sequence (s „) is 

4=o 

Cauchy and the conclusion follows. 

Passing to the limit as n —> oo in (4.9) we find 


OO 

* - Y^ x ’ ek ' )ek 

4=0 


2 


IMI 2 -£l<^4>l 2 . 

4=0 


This proves statement (iii). 


□ 


Definition 4.11 (Complete orthonormal system). An orthonormal sys¬ 
tem (e k ) ke f$ is called complete if 

OO 

x = Y {x , e k )e k Vx G H. 

4=0 

Example 4.12. Let H = £ 2 as in Example 4.2(iii). Then, it is easy to see 
that the system ( e k ), where 

e k ;= (0, 0, ..., 0, 1, 0, 0,...) (with the digit 1 in the &-th position) 

is complete. Indeed, if x — (x k ) e i 2 we have that (x, e{) = x, (the /-th 
component of the sequence x), so that 

n oo 

||x - J>, e^Cif = Y x l °- 

k =0 k=n +1 

We already noticed that M" is the canonical model of n -dimensional Hil¬ 
bert spaces H, because any choice of an orthonormal basis {iq, ..., v „} 
of H induces the linear isometry 


n 

a \—r Y^ Cl i e i 

i=i 

from M" to H (which, as a consequence, preserves also the scalar product, 
by the parallelogram identity). For similar reasons, i 2 is the canonical 
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model of all spaces H having a complete orthonormal system in 

this case, the linear map from l 2 to H given by 

OO 

a m>- afgf 

i '=0 

is an isometry, thanks to Parseval’s identity. 

Proposition 4.13 (Completeness criterion). Let (e„) be an orthonormal 
system. Then (e„ ) is complete if and only if the vector space E spanned 
by (e n ) is dense in H. 

Proof. If (e n ) is complete we have that any x € H is the limit of the 
finite sums (x, e^e,, which all belong to E, therefore E is dense. 
Conversely, if E is dense, for any x e H and any e > 0 we can find a 
vector z = a i e i with Ik ~ x II < e. By applying Proposition 4.6 
twice (first to the vector space spanned by {e\, , e m }, and then to the 

vector space spanned by {ei, ..., e „}) we get 

m n n 

ik - - h* _ ~'Yh aie ^ < s 

i=i i=i i=i 

for m > n. Since e is arbitrary this proves that the sum of the series is 
equal to v. □ 

The following proposition provides a necessary and sufficient condi¬ 
tion for the existence of a complete orthonormal system. We recall that 
a metric space ( X, d) is said to be separable if there exists a countable 
dense subset Del 

Theorem 4.14. A Hilbert space H admits a complete orthonormal sys¬ 
tem (cy-k-sN if and only if H, as a metric space, is separable. 

Proof. If H admits a complete orthonormal system (ekiteN then H is sep¬ 
arable, because the collection of finite sums with rational coefficients 
of the vectors ek provides a countable dense subset (indeed, the closure 
of contains the finite linear combinations of the vectors ry and then the 
whole space). 

Conversely, assume that H is separable and let (v„) be a dense se¬ 
quence. We define <? 0 = Vo, e \ = ry., where k\ is the first k > ko = 0 
such that Vk is linearly independent from V( h c j 2 = Vk 2 where kj is the 
first k > k[ such that 14 , is linearly independent from {e 0 , e \}, and so on. 
In this way we have built a sequence (c, ) of linearly independent vectors 
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generating the same vector space generated by (v„). Let S be this vec¬ 
tor space, and let us represent it as U„ S „, where S n is the vector space 
generated by {eo, ■ ■ ■, e n }. Notice that S is dense, as all v n belong to S. 

By applying the Gram-Schmidt process to e ,, an operation that does not 
change the vector spaces S n generated by the vectors e 0 > ■ ■ ■, e„, we can 
also assume that (e,-) is an orthonormal system. Then, Proposition 4.13 
gives that (e,) is complete. □ 

4.5. Hilbert spaces on C 

In this section we illustrate briefly how the concepts introduced so far 
extend to complex vector spaces H. A pre-Hilbert space is a complex 
vector space H endowed with a mapping 

H x H ^ C, (x,y) -* (x, y), 

called scalar product, such that: 

(i) (jc, x) > 0 for all x £ H and (x, x) = 0 if and only if x = 0; 

(ii) (jc, y) = (y, x) for all x, y £ H\ 

(iii) {ax + /3y, z) = a{x, z) + ft{y, z > for allx, y, z e H and a, /3 £ C. 

It turns out that ||jc|| := *J{x, x) is still a norm, because the Cauchy- 
Schwarz inequality still holds. Hence, we can define Hilbert spaces as 
those spaces for which the norm induces a complete distance. 

The canonical model of ^-dimensional Hilbert space is C". Given a 
measure space (X, &, /z), a basic example of Hilbert space is the space 
of ^"-measurable and square integrable functions / : X —> C. In this 
context ^-measurable means that both the real and the imaginary part of 
/ are ^-measurable. In this space one can define the scalar product 

<7 8 ) '■= f f(x)g(x)dp(x) 

Jx 

and prove that it induces an Hilbert space structure. The space -^OC) of 
complex-values sequences (z„) with (\z n \) £ IT®) is a particular case. 

The norm still satisfies the parallelogram identity, so that we can still 
prove the existence of orthogonal projections on closed subspaces and its 
characterization in terms of 

Re((x - tt y (x), z)) = 0 Vz £ Y. 

Analogously, in Remark 4.4, one has to replace the scalar product by its 
real part. 
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Riesz representation theorem still holds (now for continuous and C-linear 
functionals) and the concepts of orthonormal system and complete or¬ 
thonormal system make sense. We have Bessel’s inequality for orthonor¬ 
mal systems and Parseval’s identity for complete orthonormal systems. 
Finally, f 2 (C) is the canonical model of all separable Hilbert spaces; as 
in the real case the correspondence is induced by the choice of a complete 
orthonormal system, which provides coordinates of a vector. 

We conclude this chapter providing a natural example, considered in 
the literature, of non-separable Hilbert space. 

Example 4.15 (Quasi-periodic functions). We define the space AP(¥l) 
of almost periodic functions as the closure, with respect to uniform con¬ 
vergence in R, of the vector space generated by complex-valued periodic 
functions (of arbitrary period). This space has been extensively studied 
by Bochner and Bohr. It is easy to show that the space of almost periodic 
functions is not only a vector space (it is a subspace of C(R, C)), but also 
an algebra, i.e. fg e AP(M.) whenever f. g e T P (R). 

If / is almost periodic one can also show (by approximation, taking 
into account that this property is linear with respect to / and holds for 
periodic functions) that there exists the limit 

1 f T 

M(f) := hm — f(x + t)dt. 

T —»-+oo 2.1 J—T 

In addition, it is easily seen that the limit is independent of x. 

The space AP(R) of all almost periodic functions is a pre-Hilbert 
space when endowed with the following inner product 

(f,g) A p:=M(fg) f,geAP(R). 

For any leM define 


exit) = e ,Xt , t e R. 

Then ex e AP(R), [ex, ex )ap = 1 and 

e iT(k-v) _ e -iT(\-v) 

(ex, e v ) AP = lim -——---= 0 whenever A. / v, 

7^+00 Ti(X — v) 

so that (<?a)/.cR is an orthonormal system in AP(M) having the cardinality 
of continuum. One can also characterize the (abstract) Hilbert completion 
of AP(R) (the so-called Bohr almost periodic functions) and prove that 
the system \e x )x^R is complete. For more details see e.g. [4]. 
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Exercises 

4.1 Let ( X , || ■ ||) be a normed space, and assume that the norm satisfies the 
parallelogram identity (4.2). Set 

{x, y) := ~\\x + yf - -||* - y\\ , x, y e X. 

Show that •} is a scalar product whose induced norm is || • ||. Use this identity 
to show that any linear isometry between pre-Hilbert spaces preserves also the 
scalar product. 

4.2 Show that, in the situation considered in Remark 4.4, ttk (x) is characterized 
by the property 

{x — tik(x), z — jtk(x)} <0 Wz e K. 


4.3 Let H be a finite dimensional pre-Hilbert space and let v n }, with 

n = dim H, be a basis of it. Define 


fi=vi, f 2 = v 2 - 


(V2, fl) 


fl, h = V3 ~ 


(V3, fl) 


fl - 


(^3> fl) 


fl, 


{fl> fl) (fl,fl) {fit fl) 

Show that e ; = ///II./i'll is an orthonormal system in H (notice that Vk — fk is 
the projection of Vk on the vector space generated by {i>i,..., Vk -1 })■ 

4.4 Let H be a Hilbert space, and let X be an infinite-dimensional separable 
subspace. Show that 


OO 

Ttxix) = J2 {X , ek)ek Vx e H, 
k =o 


where (e*) is any complete orthonormal system of X. Hint: show that the vector 
x — { x ’ e k) e k is orthogonal to all vectors of X. 

4.5 Let X be the space of functions / : [0, 1] R such that / (x) / 0 for at 
most countably many x, and f 2 (x) < +oo. Show that X, endowed with the 
scalar product 

{ftg)'-= f( x )8 ( x ). 

jce[0,1] 

is a non-separable Hilbert space. 

4.6 Let (ek)kc_N be a complete orthonormal system of H. Show that, for any 
x, y e H we have 

OO 

^{x, ek){y, e k ) = {x, y). (4.10) 

k =0 

4.7 ★ Show that for any Hilbert space H there exists a family (not necessarily 
finite or countable) of vectors {e, }, € / such that: 

(i) (e,-, ef) is equal to 1 if i = j, and to 0 otherwise; 

(ii) for any vector x e H there exists a countable set J C I with 

x = ^(x, e/>e/. 
ieJ 


Hint: use Zorn’s lemma. 






Chapter 5 
Fourier series 


In this chapter we study the problem of representing a given T -periodic 
function as a supeiposition, for a suitable choice of the coefficients, of 
more “elementary” ones. This problem was first studied by J. Fourier 
in the case when the elementary functions are the trigonometric ones 
(nowadays we know that many different choices are indeed possible). 
Thanks to the theory of L 2 spaces and of Hilbert spaces developed in the 
previous chapters, the problem can be formalized by looking for com¬ 
plete orthonormal systems in L 2 made by trigonometric functions. 

We shall mostly be concerned with the case of 2n -periodic functions, 
but a simple change of scale (see Remark 5.1) easily provides the trans¬ 
lation of the results to arbitrary periods. 

We are concerned with the measure space ((— n, it), n)), k), 

where k is the Lebesgue measure. As usual, we shall write for brev¬ 
ity 7 r). We shall denote by (-, •) the canonical scalar product 

given by 

71 

{f,g)-= J f{x)g(x)dk = J f(x)g(x)dx, f, g e L 2 (-jt, jt). 

r) — n 

Let us consider, as a family of elementary functions, the trigonometric 
system , given by: 

1 1 1 

_ ; —= coskx , k e N, k > 1; —= sin kx, k e N, k > 1. 

V27T v 71 ’ s/tt 

(5.1) 

It is easy to check with integration by parts that this is an orthonormal sys¬ 
tem in L 2 (—tt, 7 r), see Exercise 5.1. Thus, in view of Proposition 4.10, 
the series of functions 

1 OO 

S(x) = - «o + 'y^S a k cos kx + b k sin kx), (5.2) 

2 it=i 


L. Ambrosio et al., Introduction to Measure Theory and Integration 
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is convergent in L 2 (— tt, tt) for any / e L 2 {—tt, tt), where 

1 r n 

a k := — / f{y) coskydy, k e N, 

* J-n 


and 


i r 

bk := — / f{y) sinkydy, k e N, k > 1. 

J —7T 


Notice that a 0 /2 is the mean value of / on (— n, tt), in agreement with 
the fact that all terms in the series (5.2) have mean value 0 on {—n, n). 
To recognize (5.2) in terms of scalar products, we see that the term a 0 /2 
corresponds to 

/, —L=\ —— 

spin I Pin 

and the terms cik cos kx, b k sin kx for k > 1, correspond respectively to 


1 


u 


/, —— cos kxj —= cos kx, (/, —j= sin kx ) —j= sin kx. 


1 


1 


Formula (5.2) is called the trigonometric Fourier series of /. 

The Bessel inequality (4.7) reads, in this context, as follows: 

1 /» 7T 1 OO 

- / \f(x)\ 2 dx > - al + ^{al + b\). (5.3) 

77 J-x 1 k= t 

Indeed, it is easily seen that o^tt/2 = ((/, Xjspln)) 2 and, for > 1, 


apn = I (/, —— cos 


h 7 7T = I (/, —— sin^rx 


First, we shall find sufficient conditions on / ensuring the pointwise con¬ 
vergence of the series S(x) to f{x) in {—n, tt). Then, we shall show 
that the trigonometric system is complete, so that the inequality above 
is actually an equality. As shown in Exercise 5.4 and Exercise 5.5, the 
trigonometric system, the trigonometric series and the form of the coef¬ 
ficients become much more nice and symmetric in the complex-valued 
Hilbert space L 2 ({—tt, tt); C): 


/(x) = J2 a ne inx 
ne Z 


1 c n 

where a n := — / f{x)e " ,x dx. 
2 TT J- n 
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Remark 5.1 (2 T -periodic functions). If / e L 2 (—T, T) we can write 
instead 


OO 

CIq —> 71 71 

fix) = — + 2_^a k cos —kx + b k sin —kx 


k= 1 


with 


a k ■- 


i r 

j: J f(x)dx 


if k = 0; 


1 f 1 

— / f(x)cos—kxdx if k > 0, 

T J_ T Jy ’ T 


7X 


:=I r 

TU 


7T 


b k := — / f(x) sin — kx dx. 
l-T T 


5.1. Pointwise convergence of the Fourier series 

For any integer iV > 1 we consider the partial sum 

1 A 

Sn(x) := - a 0 + y^( a kCOskx + b k sinfcx), x e [—n, jt). 

2 k= l 

Since the functions cos kx and sin kx are 27r-periodic, it is natural to 
extend / to the whole of M as a 27r-periodic function /, setting 

f(x + 2nn) = f(x), x e [—jt, it), n — ± 1 , ±2 . ( 5 . 4 ) 

We shall denote in the sequel by H kr (z) the “Heaviside” function 


H,Az) := 


7 if z < 0; 

r if z > 0. 


Lemma 5.2. For any integer N > 1 and x, l, r e 1R we have 


Sn(x) - 


l+r 1 


f 

J —1 


fix + r) - Hi r i r) . T f l\ 1 

, - sin L/V + - t dr. 

2jt J_ n sin(r/2) L V 2/ J 

(5.5) 
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Proof. Write 


N 


Sn(x) = - ciq + cos kx + bk sin kx) 


k= 1 


\ C i A 

— — I f(y) —h ^ (cos kx cos fcy + sin kx sin ky ) 

^ ./-7T ^ . 


i r 


f(y) 


k= 1 

1 " 

- + ^cosk(x - y) 
2 £=i 




dy. 


To evaluate the sum, we notice that for any z e 




+ ^ cos &z 


£=1 


sin 


GO 


= \ t sin G0 + I]( sin [G + DO - sii # - §)*)]] 


k= 1 


= - sin[(iV + \)z\. 


Therefore 


| + f C0Sfe= >!hIfTtiM 

7 ^ 2 sin(h) 


k=\ 


and so, 


sin[(N + |)(x-y)] 


Now, setting r = y — x we get 


1 f n x ~ sin \(N + 5 ) rl 

Sn(x) = — / f{x + r) ---—--dr 


sin G T ) 


1 Z 171 ~ sin \(N + rl 

= ln\ f{x + T) ' dT 


sin (^r) 


(5.6) 


Sn(x) = ^~ f f(y) L ) " J dy. (5.7) 

2 ?r sin(i(x-y)J 


since the function under the integral is 2 tt- periodic. Now, integrating 
(5.6) over [— 7 r, 7 r] yields 
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so that 


-f 

tt Jo 


sin[(A^ + i)r] 


sinQr) 


dr = 1 = — 

n J-n 


1 f° 

x J-n 


sin [((V + |) r] 


sin (|r) 


dr. 


If we multiply both sides by / and r, and subtract the resulting identities 
from (5.7), (5.5) follows. □ 


Proposition 5.3 (Dini’s test). Let x, l, r 6 M be such that 
I fix + r) - H, A r)l 


f 


■ dr < oo. 


I sin(r/2)| 

Then the Fourier series of f converges to (/ + r)/2 at x. 


(5.8) 


Dini’s test shows a remarkable property of the Fourier series: while the 
specific value of the coefficients a k and b k depends on the behaviour of / 
on the whole interval (— it, it), and the same holds for the Fourier series, 
the character of the series (convergent or not) at a given point x depends 
only on the behaviour of / in the neighbourhood of x : indeed, it is this be¬ 
haviour that influences the integrability of (/(x + r) — H\ r (r))/ sin(r/2) 
(the only singularity being at r = 0 ). 

In the next example we provide sufficient conditions for the conver¬ 
gence of the Fourier series. 

Example 5.4. Assume that / : [— jt, tv] —► K is L-Lipschitz continuous, 
i.e. 

I/M - /Ml <L\X- y | v X, y e [—tt, tv] 

for some L > 0. Then Dini's test is fulfilled at any x £ M \ Z 7 r choosing 
1 = r = f{x), and at any x £ Z 7 r choosing / = /(x_) and r — /(x + ) (1) . 
Indeed, with these choices of / and r , the quotient 

fix + f) ~ HiA r) 

sin(r/ 2 ) 

is bounded in a neighbourhood of 0 . 

The same conclusions hold when / is a-Holder continuous for some 
a £ ( 0 , 1 ], i.e. 


I fix) - /(y)| < L\x - y\ a , V x, y £ [-it, it ] 

for some L > 0: in this case the quotient is bounded from above, near 0, 
by the function L|r|“/| sin(r/2)| ~ 2L|r |“ _1 which is integrable. 


11 * here we denote by g(x~). g(x+) the left and right limits of g at x 
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More generally, the argument of the previous example can be used to 
show that the Fourier series is pointwise convergent for piecewise C 1 
functions /: at continuity points x the series converges to fix), and at 
(jump) discontinuity points x it converges to (/(x_) + /(*+))/2. How¬ 
ever, the mere continuity of / is not sufficient to ensure pointwise con¬ 
vergence of the Fourier series. 

In order to prove Proposition 5.3, we need the following Riemann- 
Lebesgue lemma, a tool interesting in itself. 

Lemma 5.5. Let (e k ) be an orthonormal system in L 2 (—tt , it). Assume 
that there exists M > 0 such that || e* || oo 5 M for all k e N. Then for 
any f e L*(— n, tt) we have 


r 

lim / f(x)e k (x)dx = 0. 

k^oo J_ n 


(5.9) 


Proof. Notice first that if / e L 2 {—tt, n) the conclusion of the lemma is 
trivial. We have in fact in this case 



and, since by Bessel’s inequality the series |(/, e k )\ 2 is convergent, 
we have Yim k (f, e k ) = 0. 

Let us now consider the general case. We know that bounded continu¬ 
ous functions are dense in tt), hence for any e > 0 we can find 

g € Cb(—n, tt) such that ||/ — g|| i < e. As a consequence 


\(f, e k )\ = \{f - g, e k )\ + \{g,e k )\ < Ms + Kg', e*>| 


and letting k —»■ oo we obtain limsup^. \(f,e k )\ < Ms. Since s is arbit¬ 


rary the proof is achieved. 

Proof of Proposition 5.3. Set 


□ 


_ f{x + r) - H,. r (x) 
sin(r/2) 


e L 1 {—tt, tt). (5.10) 


Then, writing 


1 


1 


1 



and applying the Riemann-Lebesgue lemma to g cos t /2 (with = 
sin Nt) and to g sin(f/2) (with e n = cos Nt) we obtain from (5.5) that 
Sn(x) converge to (/ + r)/ 2. □ 
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5.2. Completeness of the trigonometric system 

Proposition 5.6. The trigonometric system (5.1) is complete. In particu¬ 
lar equality holds in (5.3) and 


lim 

N^-OO 



— Sn /Ml 2 dx = 0 


V/ e L 2 (-tt, it). (5.11) 


Proof. We show that the vector space E generated by the trigonometric 
system is dense in L 2 (—tt, tt). Let H' be the closure, in the L 2 (—tt, it) 
norm, of E, that is easily seen to be still a vector space as well. We will 
prove in a series of steps that H' contains larger and larger classes of 
functions. 

Let / : [—it, tt] —> [0, +oo) be a Lipschitz function, and let us prove 
that it belongs to H'. Indeed, we know from Example 5.4 that S N - / 
pointwise in (—tt, tt). On the other hand, we already know from Propos¬ 
ition 4.10(ii) that the Fourier series is convergent in L 2 (—tt, n) to some 
function g (which is indeed, by Exercise 4.4, the orthogonal projection of 
/ on H'), therefore a subsequence (S^/fk)) is converging /.-almost every¬ 
where to g. It follows that g = f and Sn —>■ f in L 2 (—n, n). 

If now g : [—71, 7 T] —> [0, +oo) is continuous, we know that g can be 
monotonically approximated by the Lipschitz functions 

gx(x) ■= min (g(y) + X\x - y|), x e [-tt, tt] 

ys[—7i,7i] 


(see Exercise 2.11), that converge to g also in L 2 (—tt, tt) by the dom¬ 
inated convergence theorem. As a consequence also g belongs to H'. 
Since H' is invariant by addition of constants, we proved that all continu¬ 
ous functions in [— 7 T, 7 r] belong to H'. We conclude using the density of 
this class of functions in L 2 (—tt, tt). □ 

Remark 5.7. Let / e L 2 (—tt, tt). Then, the Parseval identity reads as 
follows 

J /*7T OO 

- / \f(x)\ 2 dx = - al + ^(a* + b\). (5.12) 

71 1 k =l 

For instance, taking f(x) = x one hnds the following nice relation 
between tt and the harmonic series with exponent 2 : 


00 


E 


1 

k 2 


TT 


2 


6 ' 


Notice that (5.11) provides, for any / e L 2 (—tt,tt), the existence of 
a subsequence N(k) such that Suffix) —> f(x) for Jzf 1 -a.e. * e 
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(— 7 r, tt). Is it true that the whole sequence S^f converges a.e. to /? 
This problem, surprisingly difficult, has been solved by L.Carleson only 
in 1966, see [1]. 

Finally, we notice that there exist other important examples of com¬ 
plete orthonormal systems, besides the trigonometric one. Some of them 
are illustrated in the exercises. 

5.3. Uniform convergence of the Fourier series 

We conclude by studying the uniform convergence of the Fourier series. 
We recall that a series x n in a Banach space E is said to be totally 
convergent if the numerical series Y^ \\ x n I is convergent. Using the 
completeness of E it is not difficult to check (see Exercise 5.2) that any 
totally convergent series is convergent (as we have seen in the previous 
chapter, this means that the finite sums Yo x « converge in E to a vector, 
denoted by Yo* x n)- 

Now we show that the Fourier series of C 1 functions / with f(—n) = 
/( n) are uniformly convergent: the proof highlights two important prin¬ 
ciples, whose validity extend to higher order derivatives (see Exercise 
5.9) and to Fourier transforms: first, the Fourier coefficients of the deriv¬ 
ative of a function are linked to the Fourier coefficients of the function; 
second, higher regularity of / implies a faster decay of the Fourier coef¬ 
ficients, and therefore a convergence in stronger norms of the Fourier 
series. 

Proposition 5.8. Assume that f e C*([— jt, tt]) and that f(—n) = 
f [tt ). Then the Fourier series of f converges uniformly to f in [—tt, tt ]. 

Proof. We first notice that / in (5.4) is Fipschitz continuous, so that by 
Proposition 5.3 we have 

2 CO 

f{x) = - fl 0 + ^(a A , cos kx + bk sin Ur) Vv e [— tt , tt ]. 

2 k=\ 

Fet us consider the Fourier series of the derivative f of /, 

CO 

y>[. cos kx + b' k sin Ur) x e [—n, tt ], 

k= 1 

where, for k > 1 integer, 

a' k =- [ f(y)coskydy, b' k = - / f'{y) sin kydy. (5.13) 

It J—Jt tt J —71 
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Notice that a' 0 = 0 because /(— tt) = /{tt) implies that the mean value 
of f on {—tt, tt) is 0. As easily checked through an integration by parts 
(using again the fact that /{—tt) = f {n)), we have a' k = kb k and If, = 
—ka k . Then, by the Bessel inequality it follows that 

OO OO -J^ pjt 

J2k 2 {a 2 k +b 2 k ) = ^(fll.) 2 + (^.) 2 < - / \f'{x)\ 2 dx < oo. (5.14) 

Jt=l k= 1 71 ■'“ 7r 


Therefore the Fourier series of / is totally convergent in C([— tt, tt]) and 
therefore uniformly convergent. We have indeed 


OO 

> max | a k cos kx + b k sin kx \ 


< 


+ i^i) 


k= 1 
/ oo 


1/2 


1/2 


< ( ^k 2 {\a k \ + \b k \) 2 ) l^k 2 ) < °°- 


U-i 


a=i 


O 


Exercises 

5.1 Check that the trigonometric system (5.1) is orthogonal. 

5.2 Let £ be a Banach space. Show that any totally convergent series /f n x n , 
with {x n ) C E, is convergent. Moreover, 



(5.15) 


Hint: estimate || ffo x n — /Zo x n\\ w hh the triangle inequality. 

5.3 Prove that the following systems on L 2 ( 0, tt) are orthonormal and complete 



sin kx, 


k> 1, 


and 


1 



cos kx, 


k > 1. 


5.4 Show that 

e k {x) := —^=e lkx , k e Z 
■Jhr 

is a complete orthonormal system in Lr {{—tt, tt); C). Hint: in order to show 
completeness, consider first the cases where / is real-valued or if is real-valued. 
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5.5 Let (ek) be as in Exercise 5.4. Using the Parseval identity show that 

2 


§(/ 


f(x)e lkx dx 


V/ e L z ((—7r, Jr); C). 


5.6 Let / e L 2 {{—tc, n); C) and let Sn f = e k) e k, with N > 1, be the 

Fourier sums corresponding to the complete orthonormal system in Exercise 5.4. 
Show that 


f(x) - S N f(x ) = [ G n (x - y)(f (x) - f(y))dy 
J —71 


with 


G n {z) := 


sin ((N + l/2)z) 
sin(z/2) 


Hint: use the identities J2o e ‘ ky = ( e ‘ y ) k = {e l ^ N+r>z — \)/{e ,y — 1). 

5.7 Arguing as in Remark 5.7, show that k~ 4 — tt 4 /90. Hint: consider the 
function f(x) = x 2 . 

5.8 Chebyschev polynomials C„ in L 2 (a, b), with (a, b) bounded interval, are 
the ones obtained by applying the Gram-Schmidt procedure to the vectors l,x, 
x 2 , x 3 , .... They are also called Legendre polynomials when ( a , b) — (— 1, 1). 


(a) Compute explicitly the first three Legendre polynomials. 

(b) Show that (C'„)„sN is a complete orthonormal system. Hint use the density 
of polynomials in C([a, b]). 

(c) ★ Show that the n-th Legendre polynomial P n is given by 


Pn(x) 


In ■ 1 1 d n 

V 2 2Hi\~cHx 


(x 2 - 1)". 


5.9 Let / e C m {[—n, 7r]; C) with f^(—Tt) = for all j = 0,, m — 

1. Show that 4 m) , the k-th Fourier coefficient of f <m> is linked to c>, the k-th 
Fourier coefficient of /, by c£ m) = ( ik) m ck■ 





Chapter 6 

Operations on measures 


In this chapter we collect many useful tools in Analysis and Probability 
that will be widely used in the following chapters. We will study the 
product of measures (both finite and countable), the product of measures 
by L l functions, the Radon-Nikodym theorem, the convergence of meas¬ 
ures on the real line K and the Fourier transform. 

6.1. The product measure and Fubini-Tonelli theorem 

Let (A, &) and (Y, be measurable spaces. Let us consider the product 
space A x Y. A set of the form Ax B, where and B e , is called 

a measurable rectangle. We denote by 38 the family of all measurable 
rectangles. 38 is obviously a n -system. The a -algebra generated by 38 
is called the product a-algebra of & and . It is denoted by .T- x ( /L 
Given er-finite measures /z in (A, 3P) and v in (T, (#), we are going to 
define the product measure /i x y in (A x Y. .A x ( 3). 

First, for any E e 38 x Sf we define the sections of E, setting for 
x e X and y e Y, 

E x := !y e Y : (x, y ) e E], E? := {x e X : (x, y) e E}. 

Proposition 6.1. Assume that /z and v are a -finite and let E e 33 x . 
Then the following statements hold. 

(i) E x e 3? for all x e X and E y e 3P for all y £ Y. 

(ii) The functions 


x m>. v(E x ), y i > p(E y ), 


are 3P-measurable and < S—measurable respectively. Moreover, 



( 6 . 1 ) 


L. Ambrosio et al., Introduction to Measure Theory and Integration 
© Scuola Normale Superiore Pisa 2011 
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Proof. Assume first that £ = A x B is a measurable rectangle. Then, if 
(jt, y) e X x Y we have 

J B if jc e A pv (A if yeB 

x (0 ifi^A, [0 if yfB. 

Consequently, 

v(E x ) = 1 a (x)v(B), pfE y ) = 1 B (y)n(A), 

so that ( 6 . 1 ) clearly holds. 

Now, let 3> be the family of all £ e x such that (i) is fulfilled. 
Clearly, ^ is a Dynkin system including the n -system M. Therefore, (i) 
follows from the Dynkin theorem. 

Now, if both /x are v are finite, let .(0 be the family of all £ e & x 
<£ such that (ii) is fulhlled. Clearly, Qt is a Dynkin system including 
the 7 r-system Stf (stability under complement follows by the identities 
v((E c ) x ) = v(Y) - v(E x ) and /x((£ c F) = p,(X) - /x(£ v )). Therefore, 
(ii) follows from the Dynkin theorem as well. 

In the general cr-finite case we argue by approximation: if E e x , 
& 9 X h t X and B Y h \ Y satisfy p.{X h ) < oo and v(Y h ) < oo, we 
define the finite measures 


Eh(A) = Ai(A n X h ), v h (B) = v(B n Yh) 


to obtain that x m>- v/,(£ y ) is ^"-measurable and y Hh(E x ) is Im¬ 
measurable for all £ e S’ x . Passing to the limit as h —> oo in the 
identity 



v h (E x )d/i(x) = 


L 

L 


v h (E x )dn h (x) 
Eh (£' v ) dv(y) 


L 


Eh(E y )dv h (y) 


the continuity properties of measures and integrals give (6.1) as well. □ 


Theorem 6.2 (Product measure). If n and v are a -finite, there exists a 
unique measure X in (X x Y, ffi x ) satisfying 

A .(A x B) = /x(A)y(£) for all A e JF, Be <S. 

The measure X is a -finite and denoted by p x v. Furthermore /x x v 
is finite (resp. a probability measure) if both /x and v are finite (resp. 
probability measures). 
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Proof. Existence is easy: we set 

A .(E)— l v(E x ) dp(x) = I pc(E y )dv(y), EeJFxl#. (6.2) 

Jx Jy 

Using the continuity and additivity properties of the integral, it is immedi¬ 
ate to check that A is a measure on (X x Y, fP x ( S). In the case of a -finite 
measures, uniqueness follows by the the coincidence criterion for posit¬ 
ive measures stated in Proposition 1.15: indeed, the value of the product 
measure is uniquely determined on the n -system JE made by rectangles 
Ax B with n(A) and v (B) finite, and thanks to the cr-finiteness assump¬ 
tion there exist E n = A n x B n e JE with E n \ X x Y. □ 


Corollary 6.3. Let E e J 5 " x <£ be such that /z x v(E) = 0. Then 
fi(E y ) = 0 for v—almost all y e Y and v( E x ) = 0 for fi—almost all 
x e X. 


Proof. It follows directly from (6.2). □ 

We consider here the measure space (X x Y, TP x A), where A = 
fi x v and p. and v are o -finite. 

Theorem 6.4 (Fubini-Tonelli). Let E : X x Y — »■ [0, +oo] he a , f X x r ,d- 
measurable map. Then the following statements hold. 

(i) For any x e X (respectively y e Y), the function y F (x , y) (re¬ 
spectively x m>- F(x,y)) is &-measurable (resp. fE-measurable). 

(ii) The functions 

jc i—> / F(x,y)dv(y), y i—► / F(x, y) dp(x) 

Jy Jx 

are respectively X-measurable and measurable. 

(iii) We have 


f F(x, y)dk(x, y) = [ [ F(x,y)dv(y) 

JxxY JX Uy 

= f [ F(x,y)d[i(x) 
Jy \_Jx 


dp(x) 


dv(y). 


(6.3) 


Proof. Assume first that F = l E , with E e .(E xTT . Then we have 


F(x,y) =l Ex (y), x e X, F(x,y)(x) = l Ey (x), y e Y, 

so (i), (ii) and (iii) follow from Proposition 6.1. Consequently, by lin¬ 
earity, (i)-(iii) hold when F is a simple function. If F is general, it 
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is enough to approximate it by a monotonically increasing sequence of 
simple functions and then pass to the limit using the monotone conver¬ 
gence theorem. □ 

Remark 6.5 (The definition of integral revisited). We noticed in Re¬ 
mark 2.13 that the integral of nonnegative functions can also be defined 
without using the archimedean integral, by considering minorant simple 
functions. If we follow this approach, the identity that we used to define 
the integral can be derived by applying the Fubini-Tonelli theorem to the 
subgraph 

E := {(v, t) e X xR : 0 < t < f(x )}, 

with the product measure /x x X, A being the Lebesgue measure. Indeed, 
it is not difficult to show that E is & x SS (R)-measurable whenever / 
is ^"-measurable, so that 


n OO n OO n 

/ /x({/ > t}) dt = / l^(E') dt = /x x X{E) = / X(E x )dp,(x) 

Jo Jo Jx 


-L 


= / f{x)dn{x). 


Of course, splitting F in positive and negative parts, also the case of 
extended real valued maps can be considered: 

Corollary 6.6. Let E : X x Y —»■ [—oo, +oo] be a & x ( S-measurable 
map. Then F is p x v-integrable if and only if: 


(i) for p-a.e. x G X the function y F (x , y) is v-integrable; 

(ii) the Junction x i—>■ f y \F(x, y)\dv(y) is p-integrable. 

If these conditions hold, we have 


/ 

JXxY 


F(x, y)d(p x v)(x, y) = 


/[/ 


F{x, y)dv{y) 


dp(x). (6.4) 


Notice that, strictly speaking, the function in (ii) is defined only out of 
a /i-ncgligible set; by /x-integrability of it we mean /x-integrability of 
any ^"-measurable extension of it (for instance we may set it equal to 0 
wherever f y \F(x, y)| dv(y) is not finite). 

Remark 6.7 (Finite products). The previous constructions extend with¬ 
out any difficulty to finite products of measurable spaces (X,- ,^", ). Name- 

ly, the product o -algebra .IF := X ( -X' in the cartesian product X := 
\s n 

X j X/ is generated by the rectangles 


{Ai x • • • x A„ : Aj e 1 < i < n]. 
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Furthermore, if /z, are o -finite measures in (X t , <#",), integrals with re- 

w n 

spect to the product measure /z = A , /z, are defined by 


L 


F(x ) dfx(x ) = 


/ /■■■/ 

Jx t Jx 2 Jx 


F(x 1 , ..., Xn)di± n (x n ) • • ■ dii 2 (x2)dii\{xi), 


and any permutation in the order of the integrals would produce the same 
result. Finally, the product measure is uniquely determined, in the o- 
finite case, by the product rule 


H (A x x • • ■ x A n ) = 


n 


n iXi(Ai) 

i=1 


Aj e ^i, l < i < n. 


It is also not hard to show that the product is associative, both at the level 
of cr-algebras and measures, see Exercise 6.1. 


6.2. The Lebesgue measure on M” 

This section is devoted to the construction, the characterization and the 
main properties of the Lebesgue measure in M", i.e. the length measure 
in M 1 , the area measure in M 2 , the volume measure in M 3 and so on. 

Definition 6.8 (Lebesgue measure in M"). Let us consider the measure 

space (M, SB (M), F £ 1 ), where . A 1 is the Lebesgue measure on (M)). 

n 

Then, we can define the measure space (M'\ X fi(M), with F£ n := 

i=i 

v, n 

X j ££ . We say that S£" is the Lebesgue measure on M". 

Since (see Exercise 6.2) 


^(JET) = X FB(R), 

i=1 

we can equivalently consider 2zf" as a measure in (M", FA (IRt")), forget¬ 
ting its construction as a product measure (indeed, there exist alternative 
and direct constructions of F£ n independent of the concept of product 
measure). 

As in the one-dimensional case, we will keep using the classical nota¬ 
tion 

/ f{x) dx \= f fl E d££ n E e ^(M"), / : M" -* M Borel 

Je J R" 

for integrals with respect to Lebesgue measure F£ n (or Riemann integrals 
in more than one independent variable). 
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In the computation of Lebesgue integrals, a particular role is sometimes 
played by the dimensional constant co n = «£?"(#(0, 1)) (so that mi — 2, 
a >2 = Jt , coj = 4tt/ 3,. ..A general formula for the computation of co n 
can be given using Euler’s V function: 


r (z) 



z > 0. 


Indeed, we have 


co n 


; r n ' 2 

r(| +1)' 


(6.5) 


A proof of this formula, based on the identity T(z + 1) = zT(z) (which 
gives also T (n) = (n — I)! for n > 1 integer) is proposed in Exercise 6.7. 

We are going to show that " is invariant under translations and rota¬ 
tions. For this we need some notation. For any ael" and any <5 > 0 we 
set 


Q(a , 5) : = {x e M" : a, < X; < a, + <5, V i = 1, ..., n} 


= X [a,-, fl, + 8). 

i=1 

Q(a, S) is called the 8-box with corner at a. For all N e N we consider 
the family 

£ n = {Q(2~ N k, 2~ n ) : k = (k u ..., k n ) e Z"}. 


It is also clear that each box in 23 n is Borel and that its Febesgue measure 
is 2 ~ nN . Now we set 

OO 

J2 = \Jg N . 

N =0 

It is clear that all boxes in 23 n are mutually disjoint and that their union 
is K". Furthermore, if N < M, Q e 22 n and Q' e £? M , then either 
Q! C Q or Q fl Q' = 0. If follows that if Q, Q' e 22, intersect, then 
one of the two sets is contained in the other one. 


Lemma 6.9. Let U be a non empty open set in M" . Then U is the disjoint 
union of boxes in 23. 

Proof. For any jc e U, let Q x e 23 be the biggest box such that x e 
Q x C U. This box is uniquely defined: indeed, fix an jc; for any m there 
is only one box Q x , m £ 23 m such that x e Q x m ; moreover, since U is 
open, for m large enough Q x , m C U ; we can then define Q x — Q x ,m 
where m is the smallest integer m such that Q x m c U. 
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This family {Q x }xeu is a partition of U, that is, for any a , y e U, either 
Qx = Q y or Q x n Q y = 0; indeed, if we suppose that Q x n Q y f 0, 
then one of the two boxes is contained in the other, say Q x C Q y . This 
leads to x e Q x C Q y C U, contradicting the definition of Q x unless 
Qx = Q y - □ 

From Lemma 6.9 it follows easily that the a-algebra generated by 
coincides with J?(R")- 

Proposition 6.10 (Properties of the Lebesgue measure). The follow¬ 
ing statements hold. 

(i) (translation invariance) For any E e 3&(K. n ), x e R" we have 
jSf n (E + a') = Ef"(E), where 

E+x = {y+x: ye R"}. 

(ii) If n is a translation invariant measure on (K' ! , 'El (R' ! )) such that 
H(K) < oo for any compact set K, there exists a number C /; > 0 
such that 


pL(E) = C^ n (E) V£ef(K"). 

(iii) (rotation invariance) For any orthogonal matrix R e L(R"; R") we 
have 

Ef n (R(E)) =S£ n (E) VEe@(R n ). 

(iv) For any T e L(M"; M") we have 

^ n (T(E)) = \detT\^ n (E) V£e^(R"). 

Proof. Fix a e R". The measures .if" (E ) and ,f£ n (E + a ) coincide on 
the 7T-system of boxes; thanks to Lemma 6.9, this n -system generates 
the Borel a-algebra, so that the coincidence criterion for measures stated 
in Proposition 1.15 gives that Ef n (E) = f/ Jn (E + a ) for all Borel sets E. 
Let us prove (ii). Let 0 O e and set = fi(Qo)- Since Q o is 
included in a compact set, we have C fl < oo. Since p is translation 
invariant, all boxes in i2 0 have the same /z measure. Now, let Q N e ff ; y. 
Since Qo is the disjoint union of 2~ nN boxes in T2 N which have all the 
same /z measure (again by the translation invariance) we have that 

r(Qn) = C^ n (Q N ). 

So, Lemma 6.9 gives that /z(A) = C IL .ff n (A) for any open set, and there¬ 
fore for any Borel set. 


90 Luigi Ambrosio, Giuseppe Da Prato and Andrea Mennucci 


Let us now prove (iii). By the translation invariance of «£?", the meas¬ 
ure /z(£) = .if" (/?(£)) is easily seen to be translation invariant (because 
R(E+z ) = R(E) + R(z)), hence Jt? n (R(E)) = CJ£'\E) for some cont- 
ant C. We can identify the constant C choosing E equal to the unit ball, 
finding C = 1. 

Finally, let us prove (iv). By polar decomposition we can write T = 
R o S with S = \JT* o T symmetric and nonnegative definite, and R 
orthogonal. Notice that on one hand |detr| = dct S (because det R e 
{—1, 1}) and on the other hand, by (iii) we have 

Sf"(T(E)) = Jgf n (R(S(E ))) = JSf n (S(£)). 

Hence, it suffices to show that 2zf"(S(,E)) = dct .S'.if''(£) for any sym¬ 
metric and nonnegative definite matrix S. By the translation invariance of 
J£"\S{E)) there exists a constant C such that //"' (S(E)) = C.if " ( E) for 
any Borel set E. In this case we can identify the constant C choosing as 
E a suitable n -dimensional cube: denoting by (e,) an orthonormal basis 
of eigenvectors of S, with eigenvalues a, > 0 (whose product is det .S’), 
choosing 

E = |j> g , : |cj| < 1 J , so that S(E) = | J^cqQg, : |c, | < 1 J , 

the rotation invariance of Jzf n gives ,5f''(£) = 1 and ,!z? n (S(E)) = 
ai ■ ■ ■ a n . Therefore C = det S and the proof is complete. □ 


6.3. Countable products 

We are here concerned with a sequence (X, , /x ( ), i = 1, 2, ..., of 

probability spaces. We denote by X the product space 

OO 

X := X X k 

k= 1 

and by x = (x k ) the generic element of X. 

We are going to define a a -algebra of subsets of X. Let us first in¬ 
troduce the cylindrical sets in X. A cylindrical set 1, U ,\ is a set of the 
following form 

I„,a = {x : Oi, ••• ,x n ) e A}, 

where n > 1 is an integer and A e X , ^ k . This representation is not 
unique; however, since 

OO 

In,A — A X X Xfc 

k=n-\-\ 


we have that I n ^ = I m , b with n < m implies B — Ax X n +\ x • • • x X m . 
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We denote by ^ the family of all cylindrical sets of X. Notice also that 

K.A = In,AC 

so that ^ is stable under complement. If I nA and I„ uB belong to 77 we can 
assume by the previous remarks that m — n, so that I, uA U I nB = T.aub 
belongs to 77. Therefore 77 is an algebra. 

The cr-algebra generated by 77 is called the product a -algebra of the 
or-algebras It is denoted by 

OO 

X & k . 

k= 1 

Now we define a function /x on 77, setting 


/^(7/i,a) 



(A), 


In,A ^ • 


( 6 . 6 ) 


This definition is well posed, again thanks to the fact that I n A = I mB 
with n < m when B = Ax X n+] x • • • x X m . It is easy to check that /x is 
additive: indeed, if I n A and I m B are disjoint, using the previous remark 
we can assume with no loss of generality that n = m, and therefore the 
equality /x(/„, A U I, kB ) = /x(/„, A ) + /x(/„, B ) follows by 


X 

k= 1 



(AU B) = 



(A) + 



(5). 


Theorem 6.11. The set function /x defined in (6.6) is o-additive on 
and therefore, by the Caratheodory theorem, it has a unique extension to 

a probability measure on (X. X , J^Y) that is denoted by 


OO 

X p k 

k= 1 


Proof. To prove the o'-additivity of /x it is enough to show the continuity 
of /x at 0, or equivalently the implication 

OO 

(E/) c c <o , ( Ej) nonincreasing, pt(Ej) > £ 0 > 0 ==*> |^| Ej 0. 

n=\ 

(6.7) 

In the following we are given a nonincreasing sequence (Ej) on 77 such 
that pt(E j) > e (l > 0. To prove (6.7), we need some more notation. We 
set 

OO 

x (n) = X x k 

k=n -\-1 
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and we define fi <n> on cylindrical sets of X ln> as in (6.6). Then, we con¬ 
sider the sections of Ej defined as 

Ej(xi) = {x (1) G X (1) : (x l5 x (1) ) G Ej) , X] G X u 

Ej(x i) is a cylindrical subset of X (]) and by the Fubini theorem we have 

fx(Ej)=[ ix m (Ej(xi))dfx 1 (xi)>e 0 >0, j > 1. (6.8) 

Jx i 

Set now 

Fj ,t = {*i G X, : n ll \Ej(x!)) > |} , j> 1. 

Then | is not empty and by (6.8) we have 

lx(E/)= I ix m (Ej(xi))diM(xi) + [ ix (l) (Ej(xi))diM(xi) 

Fj. I ZFf, 


£q 

< /;,(/••,,)- . 

Therefore fi i (F ; |) > e 0 /2 for all j > 1. 

Obviously (F^i) is a nonincreasing sequence of subsets of X\. Since 
fi\ is or -additive, it is continuous at 0. Therefore, there exists a\ G 
Hr Fj, i and so 

Ai (1) (£;(«!» > y, ./ > 1. (6.9) 

Consequently we have 

Ej( ai )?0, j > 1. (6.10) 


Now we iterate the procedure: for any x 2 G X 2 we consider the section 
Ej(ai, x 2 ) = {x (2) G X (2) : (oq, x 2 , x (2) ) g£,), j > 1. 

By the Fubini theorem we have 

/u, (1) (£,•(«!)) = f /x (2) (Fj(ai,x 2 ))d/z 2 (x 2 ). (6.11) 

Jx 2 

We set 

n. 2 = jx 2 G X 2 : ii (2> (Ej(a \, x 2 )) > y) , ./' > I. 
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Then by (6.9) and (6.10) we have 

< // :l '(/-.'/(u'i)) - f ix (2 \Ej(a u X2))diX2(x 2 ) 

^ «/ X 2 

= / n {2) (E j (a l ,x 2 ))dii2(x2) + / !x (2) (Ej{a u x 2 )) dii 2 (x 2 ) 

JFj.l J[Fj, 2 Y 

< M-2(-F7,2) + — ■ 

J 4 

Therefore ii 2 (Fj t2 ) > eo/4. Since (/y 2 ) is nonincreasing and /z 2 is < 7 - 
additive, there exists a 2 e X 2 such that 

fi 2 (Ej(a u a 2 )) j> 1, 

and consequently we have 


£y((ai,Q; 2 )) 7 ^ 0 . ( 6 . 12 ) 

Arguing in a similar way we see that there exists a sequence (a*) C X 
such that 

Ej(a i,...,a n )^0, for all 7 , n > 1, (6.13) 

where 

Ej(a 1 ,..., a„) = {x e X (n) : (aq,..., a n , x (n) ) e £,} , j, n > 1. 

Since E/ are cylindrical, this easily implies that ( a n ) e fX £ ; . There¬ 
fore Ej is not empty, as required. □ 

Exercises 

6.1 Let (X], J^i), (X 2 , & 2 ), (X 3 , ^ 3 ) be measurable spaces. Show that 

(^1 x ,^ 2 ) x ^3 = ^! x (TR 2 x ^ 3 ). 

If we are given measures /q in J 2 ",-, i = 1, 2, 3, show also that (/x 1 x /x 2 ) x /X 3 = 
Mi x (M 2 x /r 3 ). 

6.2 Let us consider the measurable spaces (M, 38 (R)), (R", 38 (R' ! )). Show that 


38 ( R' ! ) = X 38 ( R). 

1 = 1 

Hint: to show the inclusion C, use Lemma 6.9. 

6.3 Let 38 n be the a -algebra of Lebesgue measurable sets in R ,J . Show that 

x jSfj C Jz? 2 . 
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Hint: to show the strict inclusion, consider the set E = F x {0}, where Fct 
is not Lebesgue measurable. 

6.4 Show that the product <r-algebra is also generated by the family of products 
A | Aj where A; e -A and A, ^ X, only for finitely many 

6.5 Writing properly ££ 3 as a product measure, compute .A 3 (7’). where 

T = |(x, y, z) : x 2 + y 2 < r 2 and y 2 + z 2 < r 2 J . 


6.6 [Computation of co n ] Find a recursive formula linking o> n to t» n - 2 , and use 
it to show that a> 2 k — n k /k\ and u> 2 k+\ — 2 i+1 n k / (2k + 1 )!!, where (2k + 1 )!! 
is the product of all odd integers between 1 and 2k + 1. Hint: use the Fubini- 
Tonelli theorem. 

6.7 Use Exercise 6.6 and the identities T(l) = 1, F(1/2) = JYt and F(z +1) = 
zF(z) to show (6.5). 

6.8 Let /x and v be a -finite measures on ( X , ■'¥) and (Y, C#) respectively and let 
X — jjt x v. Let S' = x (S)x, as defined in Definition 1.12, and let £ be 
the extension of X to S. Show this version of the Fubini-Tonelli Theorem 6.4: 
for any (^’-measurable function F : X x Y —*■ [0, + 00 ] the following statements 
hold: 

(i) for /x-a.e. x e X the function y i-> F (x , y ) is u-measurable; 

(ii) the function x i-> 1 F(x, y)dv(y), set to zero at all points x such that 

Jy 

y F(x, y) is not v-integrable, is /x-measurable; 

(iii) /x/r f (^ 3 ') 4 W 4 W = f Xx y F ( x ’ T) d S(x, y)- 

6.9 Using the notation of the Lubini-Tonelli theorem, let X = Y = [0,1], 
& = Sf = ^ ([0, 1]), let /x be the Lebesgue measure and let v be the counting 
measure. Let D = {(x, x) : x e [0, 1]} be the diagonal in X x Y; check that 
f x v(D x ) <7/x(x) ^ f Y n(Dy)dv(y). 

6.10 ★ Let (//,) be converging to / in kUx x Y, /x x v). Show the existence 
of a subsequence h(k) such that fh(k) (x, ■) converge to /(. x, ■) in L l (Y, v ) for 
/x-a.e. x e X. Show by an example that, in general, this property is not true for 
the whole sequence. 


6.4. Comparison of measures 

In this section we study some relations between measures in a measurable 
space ( X , YS ). 

The first (immediate) one is the order relation: viewing measures as 
set functions, we say that /z < v if /i(B) < v(B) for all B e &. It is not 
hard to see that the space of measures endowed with this order relation is 
a complete lattice (see Exercise 6.13): in particular 

fiWv(B) = sup {/x(Ai) + v(A 2 ) : A\, A 2 6 Y?, (Ai, A 2 ) partition of B] 
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and 

fiAv{B) = inf {/x(Ai) + v(A 2 ) : A[, A 2 e &, (A\, A 2 ) partition of B}. 

Another relation between measures is linked to the concept of product of 
a function by a measure. 

Definition 6.12. Let /x be a measure in (X, TF) and let / el 1 (X, LP , /x) 
be nonnegative. We set 


ft±(B) := f f dii 
J B 


VB e 


(6.14) 


It is immediate to check, using the additivity and the continuity properties 
of the integral, that //x is a finite measure. Furthermore, the following 
simple rule provides a way for the computation of integrals with respect 
to fn : 

I hd(fii)= f hf dii, (6.15) 

Jx Jx 

whenever h is ^"-measurable and nonnegative (or hf is /x-integrable, 
see Exercise 6.11). It suffices to check the identity (6.15) on character¬ 
istic functions h = l B (and in this case it reduces to (6.14)), and then 
for simple functions. The monotone convergence theorem then gives the 
general result. 

Notice also that, by definition, f/i(B ) = 0 whenever /x( B) = 0. We 
formalize this relation between measures in the next definition. 

Definition 6.13 (Absolute continuity). Let /x, v be measures in &. We 
say that v is absolutely continuous with respect to /z, and write v <<C /x, if 
all /x-negligible sets are v-negligible, i.e. 


li(A) = 0 


v(A) = 0. 


For finite measures, the absolute continuity property can also be given in 
a (seemingly) stronger way, see Exercise 6.14. 

The following theorem shows that absolute continuity of v with respect 
to i± is not only necessary, but also sufficient to ensure the representation 
v = f/i. 

Theorem 6.14 (Radon-Nikodym). Let /x and v be finite measures on 
(X, -X) such that v <<C /x. Then there exists a unique nonnegative p G 
L 1 (X, /x) such that 


v(E) = / p(x)dfi(x) 

J E 


VE e 


(6.16) 
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We are going to show a more general result, whose statement needs two 
more definitions. We say that a measure p is concentrated on a im¬ 
measurable set A if p{X \ A) = 0. For instance, the Dirac measure 8 a 
is concentrated on {a}, and the Lebesgue measure in R is concentrated 
on the irrational numbers, and f p is concentrated (whatever p is) on 

if* 0}. 

Definition 6.15 (Singular measures). Let ii, v be measures in (X , JF). 
We say that /z is singular with respect to v, and write p ± v, if there exist 
disjoint ^-measurable sets A, B such that /z is concentrated on A and 
v is concentrated on B. 

The relation of singularity, as stated, is clearly symmetric. Flowever, it 
can also be stated in a (seemingly) asymmetric way, by saying that /z _L v 
if p is concentrated on a v-negligible set A (just take B = A c to see the 
equivalence with the previous definition). 

Example 6.16. Let X = R, = A/i (R), p the Lebesgue measure on 
(X, AX ) and v — 8 X0 the Dirac measure at x 0 e R. Then p is concentrated 
on A := R \ {xo}, whereas v is concentrated on 6 := ji 0 ). So, p and v 
are singular. 

Theorem 6.17 (Lebesgue). Let p and v be measures on {X, AX), with \i 
a -finite and v finite. Then the following assertions hold. 

(i) There exist two unique finite measures v a and v s on ( X , -X ) such that 

v = v a + v s , v a < /z, v s -L /X. (6.17) 

(ii) There exists a unique p G L l (X, .X, /z) such that v a = p/z. 

(6.17) is called the Lebesgue decomposition of v with respect to /z. The 
function p in (ii) is called the density of v with respect to /z and it is 
sometimes denoted by 

d v 

^ dp 

Radon-Nikodym theorem simply follows by the Lebesgue theorem noti¬ 
cing that, in the case when v p the uniqueness of the decomposition 
gives v a = v and v s — 0 , so that v — v a = pp. 

Proof of Theorem 6.17. . We assume first that also p is finite. Set 
A = p + v and notice that, obviously, p <$C A and v A. Define a linear 
functional F on L 2 (X, &, A) setting 

F(tp) f cp(x)dv(x), tp e L 2 (X, AX, A). 

Jx 
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The functional F is well defined and bounded (and consequently con¬ 
tinuous) since, in view of the Holder inequality, we have 

\F{(p)\< f \(p{x)\dv{x) < f \(p{x)\dX{x) < [A(X )] 1/2 \\(P\\lHx,&,x)- 
Jx Jx 

Now, thanks to the Riesz theorem, there exists a unique function / e 
L 2 (X, &, X) such that 


/ (p(x)dv(x)= / f(x)(p(x) dX(x) 

Jx Jx 


V<p e L 2 (X, &,k). (6.18) 


Setting (p = l E , with E e &, yields 

v(E) = f f(x)dl(x)> 0, 

J E 

which implies, by the arbitrariness of E, j'(x) > 0, A-a.e. and, in par¬ 
ticular, both /i-a.e. and v-a.e. In the sequel we shall assume, possibly 
modifying / in a /.-negligible set, and preserving the validity of (6.18), 
that / > 0 everywhere. By (6.18) it follows 


L 


<P(x)(l-f(x))dv(x) 


L 


f(x)(p(x)dfi(x) 


Setting (p = l E , with E e &, yields 


V(p e L 2 (X,&,X). 

(6.19) 


f (1 - f(x))dv(x)= [ f(x)dfi(x)> 0 

J E Je 

because / > 0. Thus, being E arbitrary, we obtain that fix) < 1 for 
v-a.e. x e X. Set now 

A := {x e X : 0 < f(x) < 1}, B := {x eX : f(x) > 1}, 

so that (A, B) is a ^"-measurable partition of X, and 

v a (E) := v(E n A), v s (E) \= v(E D B) VEe^, 

so that v a = l A v is concentrated on A, v v = l B v is concentrated on B 
and v = v a + v*. 

Then, setting in (6.19) <p = t B , we see that 


fx(B) < f f dpi = f (l — f) dv = 0 
J B J B 
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because / = 1 v-a.e. on B. It follows that v s is singular with respect to 
/x. 

We show now that the existence of p such that v a = pin. Heuristically, 
this can be obtained choosing in (6.19) the function <p = (1 — /) _ 1 l£ nA , 
but since this function need not to be in L 2 (X, ^ ,X) we argue by ap¬ 
proximation: set in (6.19) 


cp(x) = (1 + fix) H-b f n (x))l EnA (x) 

where n > 1 and E e &. Then we obtain 


f (1 -f n+ \x))dv{x)= f [f(x) + f 2 (x) + --- + f n+1 (x)\dii(x). 
JEHA J ECiA 

Set p(x) =0 for x e B and 

p(x) := lim [fix) + fix) H-b f n+ \x)] = , x e A. 

n^oo 1 - fix) 

Then, by the monotone convergence theorem it follows that 

v a iE) = viE H A) = / pix)dinix)— / p(x)d/x(x). 

Jeha Je 

Setting E = X we see that p e L l [X, &, /x), and the arbitrariness of E 
gives that v a = p/x. 

Now we consider the case when /x is cr-finite. In this case there exists 
a sequence of pairwise disjoint sets ( X n ) c & such that 


X = X„ with iiiX n ) < oo. 


n =0 


Let us apply Theorem 6.17 to the finite measures /x„ = t x n, V n ~ ^X„ V - 
For any n e N let v n = (v „) 0 + (v„) s = p„/x„ + (v„) s be the Lebesgue 
decomposition of v n with respect to /x„. Now, set 


v n := 


•— ^ ' (^i)ai V s •— ^ P ■— ^ '[ Pn 1 > 


77=0 


77=0 


77=0 


k 


^ ' (Vn)a 

77=0 



77=0 


^X n 


V, 


Since 
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we can pass to the limit as k —> oo to obtain that v a and y v are finite 
measures, and v = v a + v s . Moreover, for any E e ^ we have, using 
the monotone convergence theorem, 


OO OO /» 


n =0 n=0 ^ E 


P oo 

/ £ E* 

JE n =0 


0)1 v d/XO) = p(x) d/x O). 


L 


So, v a /x, and setting E = X we see that p is integrable with respect 
to /x. Finally, it is easy to see that y s _L /x, because if we denote by 
B„ e ^ /x-ncgligiblc sets where (v„). s are concentrated, we have that ly 
is concentrated on the /x-negligible set U„ B„. 

Finally, let us prove the uniqueness of v a and v s : assume that 

V = v a + v s = v' a + v' s 

and let B , B' be /x-negligible sets where v s and v' are respectively con¬ 
centrated. Then, as B U B' is /x-negligible and both v s and u' are con¬ 
centrated on B U B', for any set E e & we have 


v„(E) = v s (Er\(BVJB')) = v(ED(BUB')) = vJ(£n(fiU5')) = v'(£). 


It follows that ly = y' and therefore v a — v' □ 

The interested reader can have a look at a different proof of The¬ 
orem 6.17 independent of Hilbert space theory, and based on three aux¬ 
iliary variational principles; it turns out that the density / of v a is the 
maximizer in the problem 


sup 



f dii : 


//x < y 


( 6 . 20 ) 


See Exercise 6.17 and Exercise 6.18 for more details. 


Remark 6.18. If /x is not a -finite then the Lebesgue decomposition does 
not hold in general. Consider for instance the case when X = [0, 1], 
& — &([0, 1]), /x is the counting measure and v = Jzf 1 . Then v <$C /x 
(as the only /x-negligible set is the empty set) but there is no p : [0, 1] —> 
[0, oo] satisfying 


v(E) = / p<7/x. 
Je 


Indeed, this function should be /x-integrable and therefore it can be non¬ 
zero only in a set at most countable. 
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Exercises 

6.11 Show that a -measurable function h is //x-integrable if and only if fh 
is /x-integrable. 

6.12 Show that (//x) V(g/x) = (/Vg)/x and (//x) A (g/x) = (/Ag)/x whenever 
/, g e L l (X, &, /x) are nonnegative. 

6.13 Let {/x,}/ 6 / be a family of measures in (X, &). Show that 


/x(B) := inf 


OO 

EwwW 

k=0 


i : N ->• /, 


( Bk ) countable ^"-measurable partition of B 


is the greatest lower bound of the family {/x,}; s /, i.e. \i < /x; for all i e I and it 
is the largest measure with this property. Show also that 


/x(Z?) := sup 


^/Lot)(£/l) : i : N ->■ I, 


k=0 

(Bk) countable ^"-measurable partition of B 


is the smallest upper bound of the family {/x;}; € /, i.e. /x > /x, for all i e I and 
it is the smallest measure with this property. 

6.14 Let /x, v be measures in (X, &) with v finite. Then v <<C /x if and only if 
for all e > 0 there exists S > 0 such that 

A e &, /x(A) < 8 ==> v(A) < e. 

6.15 Assume that v <$C /x and that v _L /x. Show that v = 0. 

6.16 Assume that a < /x + v and that er _L v. Show that er < /x. 

6.17* Prove Theorem 6.14 in the following two steps: 

(1) Show that a maximizer / in (6.20) exists. 

(2) Setting a — v — //x > 0, a satisfies 

r > 0 , Se < cr => /x(5) = 0 . ( 6 . 21 ) 

Then, apply Exercise 6.18 to conclude that a _L /x. 

6.18* ★ Let /x, cr be nonnegative finite measures satisfying (6.21). Show that 
a _L /x. Hint: first show that 

inf {/x(A) : Ae a is concentrated on A} 

has a solution A. Assuming by contradiction that /x(A) > 0 (otherwise we are 
done), show that 


B C A, /x(5) > 0 


<j(B) > 0. 


( 6 . 22 ) 
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Then, show that the numbers 

%h := sup J/r(5) : & 3 B C A, l B p > 2 h t B o J 

are infinitesimal as h -* oo, that the supremum is attained at Bh, and that 

p(C) < 2 h o (C) for all sets C C A \ 5/,. (6.23) 

Finally choose t = 2~ h , with h sufficiently large so that £/, < p(A) and B = 
A\Bk, to get a contradiction with (6.21). 


6.5. Signed measures 

Let ( X, , ( ¥) be a measurable space. In this section we see how the concept 
of measure, still viewed as a set function, can be extended dropping the 
nonnegativity assumption onAn- p(A). 

We recall that sequence (E,) C & of pairwise disjoint sets such that 
Ei = E is called a countable &-measurable partition of E. 

Definition 6.19 (Signed measures and total variation). A signed mea¬ 
sure p in ( X , J£") is a map p : TP —> R such that 

OO 

zd E ) = Yp(Ei ) 

/=o 

for all countable ^"-measurable partitions (£,) of E. 

Notice that the series above is absolutely convergent by the arbitrari¬ 
ness of (Ei): indeed, if o : N —»■ N is a permutation, then ( E a(i -)) is still 
a partition of E, hence 


= Y^ Ea ^- 

i=0 /=0 

This implies that the series is absolutely convergent. 

Let p be a signed measure. Then we define the total variation |/x| of p 
as follows: 


I OO 

Y 

'= o 


(Ei) ^"-measurable partition of E j , 

E e 


Proposition 6.20. Let p be a signed measure and let \p \ be its total vari¬ 
ation. Then \p\ is a finite measure on (X, &). 
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Proof. It is immediate to check that |/z| is a nondecreasing set function. 
Step 1 . If A, are disjoint, we have 

\H\(AUB) = |/z|(A) + |/z|(B). 

Indeed, let E = A U B and let (£,) be a countable ^"-measurable parti¬ 
tion of E. Set 


Aj = A n Ej, Bj = B n Ej, j e N. 

Then (Ay) is a countable ^"-measurable partition of A and (Bj) a count¬ 
able ^"-measurable partition of B and we have Ej = A ; -U By. Moreover, 

OO OO OO 

Y MEj)\ < \E(Aj)\ + Y < N(A) + \fi\(B), 

j= 0 7=1 7=0 

which yields |/x|(A Ufi)< |/x|(A) + |/x|(B). 

Let us prove the converse inequality, assuming with no loss of gener¬ 
ality that | /x|(A U B) < oo. Since both |/x|(A) and |/x|(B) are Unite, for 
any s > 0 there exist countable ^"-measurable partitions (A|) of A and 
(B|) of B such that 

OO OO 

Y \li(Al)\ > I At I (A) - X] 1/dCl > N (B) - 

k=0 1 k =0 1 

Since (A|, B|) is a countable -measurable partition of AU B, we have 
that 


l/x|(A U B) > ^d/t(A|)| + \li(B £ )\) > I At | (A) + |/z|(B) - e. 

it=i 

By the arbitrariness of e we have |/x|(A U B) > |/x|(A) + |/x|(B). 

Step 2. |/x| is a -additive. Since |/x| is additive by Step 1, it is enough 
to show that |/x| is er—subadditive, i.e. |/x(A)| < Eo° M(A;) whenever 
(A,) c ^ is a partition of A. This can be proved arguing as in the first 
part of Step 1, i.e. building from a partition (Ej) of A partitions (Ej fl A, ) 
of all sets A,-. 

Step 3. |/x|(A) < oo. Assume by contradiction that |/x|(X) = oo. Then 
we claim that 


there exists a partition X = A U B such that 
|/x(A)| > 1 and |/x| (B) = oo. 


(6.24) 
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By the claim the conclusion follows since we can use it to construct by 
recurrence (replacing X with B and so on), a disjoint sequence (A„) c & 
such that |/x(A„)| > 1. Assume, to fix the ideas, that /x(A„) > 1 for 
infinitely many n, and denote by E the union of these sets: then, the 
or-additivity of /x forces n(E) = +oo, a contradiction. Analogously, if 
/x(A„) < —1 for infinitely many n, we find a set E such that /i(E) — 
—oo. 

Let us prove (6.24). By the assumption |/x|(A) = oo it follows the 
existence of a partition (X„) of X such that 

OO 

X>(X„)I >2(l + |/x(A)|). 

n =0 

Then either the sum of those /x(A„) which are nonnegative or the absolute 
value of the sum of those /x(X„) which are nonpositive is greater than 
1 + |/x(X)|. To fix the ideas, assume that for a subsequence (X n ^)) we 
have /x(X„(£)) > 0 and 


2>(*«®) > l + \n(X)\. 

k=0 

Set A — X n (k) and B = A c . Then we have |/x(A)| > 1 + \fi(X)\ and 
\H(B)\ = | ix(X) - /x(A)| > |/x(A)| - |/x(A)| > 1. 

Since 

\fi\(X) = \ii\(A) + \n\(B) = oo, 

either |/x| (B) = +oo or |/x|(A) = +oo. In the first case we are done, in 
the second one we exchange A and B. So, the claim is proved and the 
proof is complete. □ 

Let ii be a signed measure on {X, J 5 "). We define 

,1 f 

A := - (|/x|+p,), ix := - (|/x| - n). 


so that 

fx =/x + — fx~ and |/x| = /x + + fx~. (6.25) 

The measure fx + (respectively n~) is called the positive part (respectively 
negative part) of /x and the first equation in (6.25) is called the Jordan 
representation of /x. 
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Remark 6.21. It is easy to check that Theorems 6.17 and 6.14 hold when 
v is a signed measure: it suffices to split it into its positive and negative 
part, see also Exercise 6.19. 

The following theorem proves also that p + and /z _ are singular, and 
provides a canonical representation of /r ± as suitable restrictions of ±/z. 

Theorem 6.22 (Hahn decomposition). Let p be a signed measure on 
(X, ■L ; ) and let /i + and p~ be its positive and negative parts. Then there 
exists a TP -measurable partition {A. B) of X such that 

p + (E) = p(ADE) and p~{E) = -p(BCE) V£ e (6.26) 

Proof. Let us first notice that /z <<C |/z|. Thus, by the Radon-Nikodym 
theorem, there exists h e L ] (X, 2^", |/z|) such that 

/z(£) = J hd\n\ XEe&. (6.27) 

Let us prove that \h(x)\ = 1 for |/z|-a.e. x e X. Indeed, set 

Er~{xeX\ h(x) > 1}, F\ \= {x e X : h{x) < -1} 

We first show that |/z|(£i) = |/tz|(Fi) = 0. Since we have 

1 /zlOEO >/z(£i) = [ hd\p\ > 1/zKLj), 

J Ei 

and the second inequality is strict if | p. \ ( E \) > 0, we have that | /z | (E \) = 
0. In a similar way one can prove that |/z|(F|) = 0, so that \h\ < 1 |/z|— 
a.e. in X. Now, let r e (0, 1) and set 


G r := {x e X : \h(x)\ < r}. 

Let (G, jt) be a countable ^-measurable partition of G r . Then we have 




L 


h d |/z| 



\h\d\p\ < r\fi\(G r ,k). 


Therefore 

OO 

J2\^G r .k)\ <r\fi\(G r ), 

k=0 

which yields, by the arbitrariness of the partition of G r , |/z|(G, ) < 
r\p\(G r ). Thus |/z|(G r ) = 0 and letting r f lwe obtain that |/z|({|h| < 
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1}) = 0. Hence, possibly modifying h in |/z|-negligible set, we can as¬ 
sume with no loss of generality that h takes its values in {—1, 1}. 

Now, to conclude the proof, we set 


A:={x eX : h(x) = 1}, 
Then for any E e AP we have 
1 


B :={x eX : h(x) = -1}. 


and 


fC( E) = - (M(E) + ME)) = \ f (1 + h)d\M 


= j ‘ 


hd\pt\ = /i(E D A), 


Mis) = i (l«l(£) - ME)) ='- [(!-h)d\,,.\ 


/ ^ 


hd\ii\ = —fi(E D B ). 


□ 


Exercises 

6.19 Using the decomposition of v in positive and negative part, show that Le- 
besgue decomposition is still possible when /i is a-finite and v is a signed meas¬ 
ure. Using the Hahn decomposition extend this result to the case when even /i 
is a signed measure. Are these decompositions unique? 

6.20 Show that \fn\ = |/|/r for any / e L l (X, S’, /r). 


6.6. Measures in M 

In this section we estabilish a 1-1 correspondence between finite Borel 
measures in M and a suitable class of nondecreasing functions. In one 
direction this correspondence is elementary, and based on the concept of 
repartition function. 

Given a finite measure \± in (M, SS (M)), we call repartition function of 
!± the function F : M —> [0, +oo) defined by 

F(x) := pt ((—oo, x]) r el. 

Notice that obviously (1) F is nondecreasing, right continuous, and satis¬ 
fies 

lim F(x ) = 0, lim F(x) e [0, +oo). (6.28) 

x—> —oo x —>+oo 

Moreover, F is continuous at x if and only if x is not an atom of /z. 


The arguments are similar to those used in Section 2.4.2, in connection with the properties of the 
function t (->• /x({(p > t}) 
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The following result shows that this list of properties characterizes the 
functions that are repartition functions of some finite measure /z; in addi¬ 
tion the measure is uniquely determined by its repartition function. 

Theorem 6.23. Let F : M — > [0, +oo) be a nondecreasing and right 
continuous function satisfying (6.28). Then there exists a unique finite 
measure /z in (M, fid (M)j such that F is the repartition function of /i. 

Proof. The proof follows the same lines of the construction of the Le- 
besgue measure in Section 1.6, with a simplification due to the fact that 
we can also consider unbounded intervals (because we are dealing with 
finite measures). We set 

y := {(a, b\ : a e [—oo, +oo), b e M, a < b} 

and denote by si the ring generated by fiF\ it consists, as it can be easily 
checked, of all finite disjoint unions of intervals in .i. We define, with 
the convention F(—oo) = 0, 


/z((a, b]) := F(b) - F(a) W(a, b ] e J. (6.29) 


This dehnition is justified by the fact that, if /z were a measure and F 
were its repartition function, (6.29) would be valid, because (a, b] = 
(—oo, b ] \ (—oo, a\. Then we extend /z to si with the same mechan¬ 
ism used in the proof of Theorem 1.21, and check that /z is additive on 
si. Also, the same argument used in that proof shows that /z is even a- 
additive: in order to prove that /z(F) = /z (F, ) whenever F and all F, 
belong to si one first reduces to the case when F = (a. b \ belongs to -i\ 
then, one enlarges F, to F[ e si with /z(F/) < /z(F,) + <52 - ' and, using 
the fact that all intervals \a ', b\ with a' > a are contained in a finite union 
of the sets F/, obtains 


OO 


OO 


d((a', b]) < J2 MW ^ 28 + J2 


Letting first 8 l 0 and then a' \. a we obtain the cr-subadditivity property 
/z(F) < ji (F,), and the opposite inequality follows by monotonicity. 

By the Caratheodory theorem /z has a unique extension, that we still 
denote by /z, to 3d (M) = a (si). Setting a = — oo and letting b tend to 
Too in the identity (6.29) we obtain that /z(M) = F(Too) e M. From 
(6.29) with a = —oo we obtain that the repartition function of /z is F. □ 
Given a nondecreasing and right continuous function F satisfying (6.28), 
the Stieltjes integral 
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is defined as f f dp F , where fi F is the finite measure built in the pre¬ 
vious theorem. The notation dF is justified by the fact that, when / = 
Xw z i\ai,biV we have (by the very definition of /z F ) 

[ fdF= f f d\i F — y~]zi(F(bi) — F{a,i)). 

Jr Jr ~t 

This approximation of the Stieltjes integral will play a role in the proof 
of Theorem 6.28. 

6.7. Convergence of measures on M 

In this section we study a notion of convergence for measures on the 
real line that is quite useful, both from the analytic and the probabilistic 
viewpoints. 

Definition 6.24 (Weak convergence). Let (/x/ ? ) be a sequence of finite 
measures on M. We say that (/x/,) weakly converges to a finite measure /z 
on M if the repartition functions F h of /z/ ; are pointwise converging to the 
repartition function F of /x on a co-countable set, i.e. if 

lim /x/, ( —oo, x] ) = /x ((—oo, x |) with at most countably many exceptions. 

h—>oo 

(6.30) 

Since the repartition function is right continuous, it is uniquely determ¬ 
ined by (6.30). Then, since the measure is uniquely determined by its 
repartition function, we obtain that the weak limit, if exists, is unique. 
The following fundamental example shows why we admit at most count¬ 
ably many exceptions in the convergence of the repartition functions. 

Example 6.25. [Convergence to the Dirac mass] Let p e C°°(M) be 

a nonnegative function such that f R pdx = 1 (an important example is 
the Gauss function (2jr)~ 1 ^ 2 e~ x2 ^ 2 ). We consider the rescaled functions 
p h (x) = hp(hx) and the induced measures /z/, = p/,«Sf *, all probability 
measures. Then, it is immediate to check that /x/, weakly converge to <5 0 : 
for jc > 0 we have indeed 

/ .X phx 

Phiy) dy = p(y)dy -* 1 

-oo J —OO 

because hx -x +oo as h —»■ +oo. An analogous argument shows that 
Pi, ((— oo, x]) —> 0 for any x < 0. If p is even, at jc = 0 we don’t 
have pointwise convergence of the repartition functions: all the reparti¬ 
tion functions F h satisfy F h ( 0) = 1/2, while F( 0) = 1. 
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Weak convergence is a quite flexible tool, because it allows also an op¬ 
posite behaviour, the approximation of continuous measures (i.e. with no 
atom) by purely atomic ones, see for instance Exercise 6.21. 

From now on we will consider only, for the sake of simplicity, the 
case of weak convergence of probability measures. Before stating a com¬ 
pactness theorem for the weak convergence of probability measures, we 
introduce the following terminology. 

Definition 6.26 (Tightness). We say that a family of probability meas¬ 
ures {/x, }i € r in M is tight if for any s > 0 there exists a closed interval 
i Ci such that 


tn(R\J)<e Vi el. 

Clearly any finite family of probability measures is tight. One can also 
check (see Exercise 6.24) that {/z, },- 6 / is tight if and only if 

lim Fj(x) = 0, lim Ffx) = 1 uniformly with respect to i e /, 

x —>—oo x—>+oo 

(6.31) 

where F t are the repartition functions of /x,. Furthermore, (see Exer¬ 
cise 6.25) any weakly converging sequence is tight. Conversely, we have 
the following compactness result for tight sequences: 

Theorem 6.27 (Compactness). Let (/x/,) be a tight sequence of prob¬ 
ability measures on M. Then there exists a subsequence (/x/,^) weakly 
converging to a probability measure /x. 

Proof. We denote by Fj, the repartition functions of /x/ 2 . By a diagonal 
argument we can find a subsequence (Fhqq) pointwise converging on Q. 
We denote by G the pointwise limit, obviously a nondecreasing function. 
We extend G by monotonicity setting 

G(x) := sup{G(g) : q e Q, q < x] rel 

and let E be the co-countable set of the discontinuity points of G. 

Let us check that is pointwise converging to G on K \ E: for 
x f E we have indeed 

limsup F h ( k) {x) < inf lim sup F h(kj (q) = inf G(q) = G(x), 

k^oo q£®,q>x qeQ,q>x 

and analogously 

liminf F/ !( £)(x) > sup liminf Fhqqiq) — sup G{q) — G(x). 

k - * 00 qeQ,q<x qeQ,q<x 
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Since (/x*) is tight, we have also 

lim F h {x) = 0, lim F h (x) = 1 

x —>—oo x—>+oo 

uniformly with respect to h, hence G(—oo) = 0 and G(+oo) = 1. 
Notice now that the nondecreasing function 

F(x) := lim G(y) 

y±x 


is right continuous, and still satisfies F(— oo) = 0 and F(+oo) = 1, 
therefore (according to Theorem 6.23) F is the repartition function of a 
probability measure /x. Since F — G on M \ E, we have Fh(jc) —> F 
pointwise on M \ E, and this proves the weak convergence of p-h(k) to /x. 

□ 

The following theorem provides a characterization of the weak conver¬ 
gence in terms of convergence of the integrals of continuous and bounded 
functions. 

Theorem 6.28. Let /x;,, /x be probability measures in M. Then /x* weakly 
converge to /x if and only if 

lim [ gdph = f gdp Vg G Cfo(M). (6.32) 

h^oo J i J R 

Proof. Assuming that /x/, —> /x weakly, we denote by F/, and F the cor¬ 
responding repartition functions and fix g e C/,(M). Let M = sup |g| 
and s > 0. By Exercise 6.25 the sequence (/x/ 2 ) is tight, so that we can 
find t > 0 satisfying /x* (M \ (—t, t]) < e for any h e N; we may as¬ 
sume (possibly choosing a larger t ) that also /x (K \ (—t, f]) < e and that 
both —t and t are points where the repartition functions are converging. 
Thanks to the uniform continuity of g in [— t, t \ we can find 8 > 0 such 
that 

x,ye[-t,t], |x — y\ < 8 =>- |g(x) - g(y)| < e. (6.33) 

Hence, we can find points t\, ..., t„ in [— t, f] such that t\ = —t, t„ = t, 
there is convergence of the repartition functions in all points t,, and t- l+ \ — 
ti < 8 for i — 1, — 1. By (6.33) it follows that sup ( _, |g — f\ < s, 

where 

II — 1 

/ : = 

i =1 
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Splitting the integrals on R as the sum of an integral on (—t,t] and an 
integral on (—t, 1 | <; we have 



/ 


f dfj. h 


< Ms + s = (M + l)g 


V/z e N, (6.34) 


and analogously 


( gdn- f 

Jr J(-tj] 


fd/i 


Since 


n n— 1 

/ / dii h = V' g(ti) [F h (t i+ i) - F h (*;)] 

J (-tA ,-=i 

n— 1 p 

Y J S{t i )[F(t i+1 )-F(t i )]= / 

;=i 


< Mg + £ = (M + l)g. 


(6.35) 


fdfi. 


adding and subtracting f ( _ t / d/x/,, and using (6.34) and (6.35), we con¬ 
clude that 


lim sup 

h — > oo 



< (M+ 1)£. 


Since £ is arbitrary, (6.32) is proved. 

Conversely, assume that (6.32) holds. Given x e R, define the open 
set A = (—oo, x ); we can easily find (gk) c C/,(M) monotonically con¬ 
verging to t A and deduce from (6.32) the inequality 


lim inf /x/,(A) > 

h—>oo 


sup lim inf / gkd/x/, = sup / 

£eN h ^°° JR keN Jr 


gk d/x = H(A). 


Analogously, using a sequence (gk) c C/,(1R) such that yy- 4- lc» with 
C = (—oo, x], we deduce from (6.32) the inequality 

limsup/x/ ? (C) < inf lim sup / gkd/Xh = inf / gkd/x = fi(C). 

h—>-oo h —^-oo Jr Jr 

Therefore we have convergence of the repartition functions for any xel 
such that /x(A) = /r(C), i.e. for any x that is not an atom of fi. We 
conclude thanks to Exercise 1.5. □ 

Notice that in (6.32) there is no mention to the order structure of R, 
and only the metric structure (i.e. the space C/,(R)) comes into play. In 
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a general context, of probability measures on a metric space (X, d) en¬ 
dowed with the Borel a -algebra SS ( X ), we say that /x/ ? weakly converge 
to /x if 

lim / gdiih= / gdfi for any function g e Cb(X). 

h^ooJx Jx 

Exercises 

6.21 Show that the probability measures 



weakly converge to the probability measure 1[ 0 

6.22 Let Fh : R —>• R be nondecreasing functions pointwise converging to 
a nondecreasing function F : R —> R on a dense set I) C R. Show that 
/*’/, (x) —> F(x) at all points x where F is continuous. 

6.23 Consider all atomic measures of the form 



where he N and «_/,, ..., a/, > 0. Show that for any finite Borel measure /x 
in R there exists a sequence of measures (/x/,) of the previous form that weakly 
converges to /x. 

6.24 Show that a family {/x, }, e / of probability measures in R is tight if and only 
if (6.31) holds. 

6.25 Show that any sequence (/x/,) of probability measures weakly convergent 
to a probability measure is tight. Hint: if /x is the weak limit and e > 0 is 
given, choose an integer n > 1 such that /x([l — n, n — 1 ]) > 1 — e and points 
x e (— n, 1 — n) and y e (n — 1 , n) where the repartition functions of /x/, are 
converging to the repartition function of /x. 

6.26 We want to extend what was shown in this section from the realm of prob¬ 
ability measures to that of finite measures. Let (/x/,), /x be finite positive Borel 
measures on R, and let /*),, F be their repartition functions. Consider the fol¬ 
lowing implications: 

(a) lim/, / R g dji h = / R g d/x Wg e C/,(R) (that is, (6.32)); 

(b) lim/, / R g dfx-h = / R g dii Vg e C f (R); 

(c) Fh converge to F at all points where F is continuous; 

(d) Fh converge to F on a dense subset of R; 

(e) lim/, /x/,(R) = /x(R); 

(f) (Fh) is tight. 

Find an example where (b) holds but (a), (c), (e) do not hold and prove the 
following implications: a =$> b, e, a => c, d O c, b A e => c, d A e =>■ /, 
dAf => e,dAf => a. As a corollary, if (e) holds (as ithappens in the case when 
all fih and /x are probability measures) we obtain that a^b^c^d^f. 
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6.8. Fourier transform 

The Fourier transform is a basic tool in Pure and Applied Mathematics, 
Physics and Engineering. Here we just mention a few basic facts, focus¬ 
sing on the use of this transform in Measure Theory and Probability. 

Definition 6.29 (Fourier transform of a function). Let / e L 1 (R, C). 

We set 

m ■= f f(x)e ~ ixf dx e R. 

Jr 

The function / is called Fourier transform of /. 

Since the map f m>- f(x)e~'^ x is continuous, and bounded by |/(x)|, 
the dominated convergence theorem gives that /(§) is continuous. The 
same upper bound also shows that / is bounded, and sup | /1 < ll/lli- 
More generally, the following result holds: 

Theorem 6.30. Let k e N be such that f R \x\ k \f\(x)dx < oo. Then 
f e C*(R, C) and 

D p f($) = (-i) p xPj{S) Vp = 0, .. ., k. 


The proof of Theorem 6.30 is a straightforward consequence of the 
differentiation theorem for integrals depending on a parameter (in this 
case, the £ variable, see the Appendix): 

Dt [ f(x)e~ lx ^ dx = f Dt (f(x)e~ ,x ^) dx 

■ Jr Jr 

= (— i) p f x p f(x)e~' x ^ dx. 

Jr 


According to the previous result, the Fourier transform allows to trans¬ 
form differentiations (in the § variable) into multiplications (in the x 
variable), thus allowing an algebraic solution of many linear differential 
equations. 

In the sequel we need an explicit expression of the Fourier transform 
of a Gaussian function. For a > 0, let 


e -\x\ 2 /(la 2 ) 

Pa(x) := (27 ra 2 y/2 


(6.36) 


be the rescaled Gaussian functions, already considered in Example 6.25. 
Then 

p a (x)e~^ x dx = e~^ a2/2 e R. (6.37) 

The proof of this identity is sketched in Exercise 6.27. 


/ 

Jr 
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Remark 6.31. (Discrete Fourier transform) I f / : M —> 1R is a 2T - 

periodic function, then we can write the Fourier series (corresponding, 
up to a linear change of variables, to those considered in Chapter 5 for 
2n -periodic functions) 

f = ^a n e in 7 x , in L 2 ((-7\ T)\ C), (6.38) 

ne Z 


with 




a„ = -— / f{x)e m T x dx, 


in-r It . . 7t 

e T = cosn—x + i smn—x. 


(6.39) 

Remark 6.32. (Inverse Fourier transform) For g e L 1 (M, C) we de¬ 
fine inverse Fourier transform of / the function 


■.-±[ 

Jm 


g(x) := ^ / g(i;)e ix t d$ 


x e 


It can be shown (see for instance Chapter VI. 1 in [7]) that the maps / m>- 
/ and g i—> g are each the inverse of the other in the so-called Schwarz 
space ,yfK, C) of smooth and rapidly decreasing functions at infinity: 


y*(R, c) 


{/ e C°°(M, C) 


lim \x\ k \D' f\(x) = 0 Wk, i e N 

\x\—>oo 


In particular we have 


f (x) = 2jt ( —) (x) = f a^e ,x ^ d% with 
\2tt J J R 

at : = — / f(x)e~'^ x dx. 

5 2tt J R 

These formulas can be viewed as the continuous counterpart of the dis¬ 
crete Fourier transform (6.38), (6.39). In this sense, at. are generalized 
Fourier coefficients, corresponding to the “frequency” £. The difference 
with Fourier series is that any frequency is allowed, not only the integer 
multiples nn/T of a given one. 

6.8.1. Fourier transform of a measure 

In this section we are concerned in particular with the concept of Fourier 
transform of a measure. 
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Definition 6.33 (Fourier transform of a measure). Let /z be a finite 
measure on M. We set 

/z(§) := f e~ ix $ dfi(x) e R. 

Jr 

The function // : M —> C is called Fourier transform of /z. 

Notice that Definition 6.29 is consistent with Definition 6.33, because 
fi — f whenever /z = /Jz? 1 . Notice also that, by the dominated con¬ 
vergence theorem, the function /z is continuous. Moreover /z(0) = /z(R) 
and, by estimating from above the modulus of the integral with the integ¬ 
ral of the modulus (see also Exercise 6.29), we obtain that |/z(§)l < /z(R) 
for all ( el. Still using the differentiation theorems under the integral 
sign, one can check that for k G N the following implications hold: 

[ \x\ k dptfx) < oo => /z G C k (R, C) and 
Jr _ (6.40) 

D'fKf ) = (-i)P x Pn(£) Vp = 0 


Let us see other basic examples of Fourier transforms of probability mea¬ 
sures: 

Example 6.34. (1) If /z = 8 X0 then fi(f) = e~ lx °f 

(2) If /z = p8 1 + q8 0 (with p + q = 1) is the Bernoulli measure with 
parameter p, then /z(£) = q + pe~‘f 

(3) If 

/* = (") / 2 V'“'<5; 

to 

is the binomial measure with parameters n, p then 

£($) = (q + pe“ !t )" e R. 


(4) 11 /z = c A l ai 0Ci) (v)..^' is the exponential measure, then 

1 


£(f) = 


1 + / £ 


g R. 


(5) If /z = (2a) is the uniform measure in [—a, a], then 

sin(a£) 


£(£) = 




el\{0). 


(6) If /z = [tt( 1 + x 2 )] 1 is the Cauchy measure, then (2) 

£($) = G R. 


This computation can be done using the residue theorem in complex analysis 
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Theorem 6.35. Any finite measure fi in M is uniquely determined by its 
Fourier transform fi. 


Proof. For a > 0 we denote by p a the rescaled Gaussian functions in 
(6.36). According to Exercise 6.27 we have 


~/2 


I 

J R 


= / p a (w)e lzw dw. 


Setting z = (x — y)/<x 2 , dividing both sides by (Ijra 2 )^ 2 we deduce that 

pAx -y)= „ \ . /2 f pAw)e- iw(x - y)l ° 2 dw. 

{2na 2 yl 2 J R 


Using Fubini-Tonelli theorem we obtain 


/ 

Jr 


Pa fix - y)dp(x) = 


-/ 
Jr 

f 

Jr 


1 


p a (w)e lw(x y)la ~dw^ dp(x) 


(! liter 2 ) 1 / 2 
Pa(w) 


(6.41) 


As a consequence, the integrals h a (y) = f R p a (y—x) dpt(x) are uniquely 
determined by fi. But, still using the Fubini-Tonelli theorem, one can 
check the identity 


f (f g(y)pa(x - y)dy) dp,(x) = [ h a (y)g(y)dy WgeC b (R). 
Jr \Jr J Jr 

(6.42) 

Passing to the limit as a l 0 and noticing that (by Example 6.25, that 
provides the weak convergence of p a X to <i 0 as a | 0, or a direct verific¬ 
ation) 


/ 

Jr 


-/ 

JR 


g(y)Pa(x — y)dy = / g(x - z)p a (z) dz 


g(x) 


V.x e 


from the dominated convergence theorem we obtain that all integrals 
f R g dp, for g e Cf, (M), are uniquely determined. Hence p is uniquely 
determined by its Fourier transform. □ 

Remark 6.36. It is also possible to show an explicit inversion formula 
for the Fourier transform. Indeed, (6.42) holds not only for continuous 
functions, but also for bounded Borel functions; choosing a < b that are 
not atoms of p and g = 1 (a h) , we have that j R g(x)p„(x — y) dy —* g(x) 
for /x-a.e. x (precisely for x {a, b}), so that (6.42) and (6.41) give 


f b C b C e~ u ' 1 ! 2a2 w 2 

p((a, b)) — lim / h a (v) dy = lim / / —- —fi(—)e iyw/a dwdy. 

or;o J a aio J a J R 2ita L a- 
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The change of variables w = to 2 and Fubini theorem give 


^ 4-0 Liz it 

for all points a < b that are not atoms of /z. 

According to Theorem 6.28 we have the implication: 



(6.43) 


/x/ 2 —> /z weakly =>• /!/, —>■ /z pointwise in M. (6.44) 


The following theorem, due to Levy, gives essentially the converse im¬ 
plication, allowing to deduce the weak convergence from the convergence 
of the Fourier transforms. 

Theorem 6.37 (Levy). Let (/Xh) be probability measures in M. If fh = 
jii, pointwise converge in M to some function f, and if f is continuous at 
0, then f = /z for some probability measure p. in M and /x* — > /z weakly. 

Proof. Let us show first that (pt h ) is tight. Fixed a > 0, taking into 
account that sin £ is an odd function and using the Fubini theorem we get 


o(%)d% = f j e ,x ^do(x)d%= f f cos (x^)d^dcr(x) 

J R J —a 



for any probability measure a. Hence, using the inequalities | sin i \ < |/1 
for all t and | sinr| < \t\/2 for |t| > 2, we get 



(6.45) 


For e > 0 we can find, by the continuity of / at 0, a > 0 such that 



By the dominated convergence theorem we get h 0 e N such that 


a 


(l — A/,(^)) d £ < sa V/z > Iiq. 


—a 


(6.46) 
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As a~ l ft (1 — /!/,(£)) —> 0 as a 0 for any fixed h, we infer that 

we can find b e (0, a] such that (6.46) holds with b replacing a for all 
h e N. From (6.45) we get ji h (K \ [— n, n\) < e for all h e N, as soon 
as n > 2/b. 

Being the sequence tight, we can extract a subsequence ( ii h (k )) weakly 
converging to a probability measure fi and deduce from (6.44) that / = 
fi. It remains to show that the whole sequence (///,) weakly converges to 
fi: if this is not the case there exist e > 0, g e C/,(M) and a subsequence 
h'{k) such that 



g diih'(k) 



> e 


Wk e N. 


But, possibly extracting one more subsequence, we can assume that !Xh'(k) 
weakly converge to a probability measure a ; in particular 



> s > 0. 


(6.47) 


As we are assuming that f h = ///, converge pointwise to / = /x we 
obtain that a = lim^ fih'(k) = /F hence fi = o. From Theorem 6.35 we 
obtain that fi = a, contradicting (6.47). □ 

Notice that just pointwise convergences of the Fourier transforms is 
not enough to conclude the weak convergence, unless we know that the 
limit function is continuous: let us consider, for instance, the rescaled 
Gaussian kernels used in the proof of Theorem 6.35 and let us consider 
the behaviour of the Gaussian measures /i a = () a .i£ A as a f oo; in this 
case, from Exercise 6.27 we infer that the Fourier transforms are point- 
wise converging in M to the discontinuous function equal to 1 at £ = 0 
and equal to 0 elsewhere. In this case we don’t have weak convergence of 
the measures: we have, instead, the so-called phenomenon of dispersion 
of the whole mass at infinity 

lim (M \ [— 77 , «]) = lim /xj (r \ [— —, —]) = q j (M \ {0}) = 1 

o-foo a^OO \ (T G / 

V/z G N 


and the family of measures is far from being tight as a f oo. 

Exercises 

6.27 Check the identity (6.37). 

6.28 ★ Show that {i is uniformly continuous in R for any finite measure /x. 






118 Luigi Ambrosio, Giuseppe Da Prato and Andrea Mennucci 


6.29 Let /x be a probability measure in M. Show that if |/t| attains its maximum 
at fo 7 ^ 0 , then there exist xq e M and c n e [ 0 , oo) such that 


l 1 — ^ ' Cn&x 


with 


x n — x 0 + 


2 rnt 

Ho 


Use this fact to show that // = 1 in M if and only if /x is a Dirac mass. 


Chapter 7 

The fundamental theorem of the integral 
calculus 


In this section we give a closer look at a classical theme, namely the fun¬ 
damental theorem of the integral calculus, looking for optimal conditions 
on / ensuring the validity of the formula 

fix) - f(y) = J f'{s)ds. 

Notice indeed that in the classical theory of the Riemann integration there 
is a gap between the conditions imposed to give a meaning to the integ¬ 
ral f* g(s ) ds (i.e. Riemann integrability of g) and those that ensure its 
differentiability as a function of x (for instance, typically one requires 
the continuity of g). We will see that this gap basically disappears in Le- 
besgue’s theory, and that there is a precise characterization of the class 
of functions representable as c + f x g(s)ds for a suitable (Lebesgue) 
integrable function g and for some constant c. 

The following definition is due to Vitali. 

Definition 7.1 (Absolutely continuous functions). Let I c K. be an in¬ 
terval. We say that / : I —> M is absolutely continuous if for any e > 0 
there exists 8 > 0 for which the implication 

n n 

^{bi -a t ) <8 =>- 1 fi b >) ~ fi a <)I < e i 1A ) 

1 = 1 1 = 1 

holds for any finite family {(a,-, ^i)}t<i<n of pairwise disjoint intervals 
contained in I. 

An absolutely continuous function is obviously uniformly continuous, 
but the converse is not true, see Example 7.7. 

Let /: [a, b\ —R be absolutely continuous. For any x € [a, b] define 

n 

F(x) = sup ^ |/(*/) - /(*/- 1 )|, 

W -t ; = 1 


L. Ambrosio et al., Introduction to Measure Theory and Integration 
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where is the set of all decompositions a — {a = x 0 < x\ < ■ ■ ■ < 
x n = x} of [a, x]. F is called the total variation of /. Let us check that 
F is finite: let <5 > 0 be satisfying the implication (7.1) with s = 1 and 
let us estimate from above a sum in the definition of F. Without loss of 
generality we can assume that |x,- — x,_i | < 8/2 for all i = l,... ,n — I, 
possibly adding more points (which increases the sum). Then, we can 
split the sum in families of intervals with total length larger than 8/2 and 
less than 8 (just keep adding a new interval to a family if the total length 
does not exceed 8 and notice that if it exceeds 8, the total length is at 
least 8/ 2); the number of these families is less than |(x — a) and, as a 
consequence, (7.1) gives 


F(x)< 


2 

8 


(x — a) + 1. 


We set 

/+(x) = l - (F(x) + f(x)), f~(x ) = (F (x) - f(x)), 

so that 

f(x) = f + (x)-f~(x), Fix) = f + (x) + f~(x), x e [a, b\. 

Lemma 7.2. Let f: [a, b] —> M be absolutely continuous and let F be 
its total variation. Then F, f + , f~ are nondecreasing and absolutely 
continuous. 

Proof. Let x e (a, b), y £ (x, b ] and cr = [a = x o < xi < • • • < x n — 
x}. Then we have 

n 

F(y) > \f(y) - fix )| + ^ | f( Xi ) - f ( Xi -{)|. 

i=i 

Taking the supremum over all a e T„ x , yields 

F(y) > | f(y) - fix )| + F(x), 

which implies that F, f + , f~ are nondecreasing. It remains to show 
that F is absolutely continuous. Let e > 0 and let 8 = 8(e) > 0 be such 
that the implication (7.1) holds for all finite families (a,-, b-,) of pairwise 
disjoint intervals with (b, — a,) < 8. Let now (a,, ly) be a family of 
disjoint intervals with (b, — a ,) < 8 and let us prove that \F(b[) — 


121 Introduction to Measure Theory and Integration 


F(cii )| < 2s. For any i = 1, ...,/? we can find cr,- = { a ,• = xqj < x\j < 
■ ■ ■ < x ni i = b [} such that 



1 < i < n. (7.2) 


Indeed, if a = yo < y\ < ■ ■ ■ < y mj — bj is a partition such that 



we can assume with no loss of generality (adding one more element to 
the partition if necessary) that yy. = a, for some k\ then, it suffices to 
estimate the first k terms of the above sum with F (ak), and to call x 0 j = 
jk, ..., x mj -k+\,i = y m , to obtain (7.2) with /;,■ = m, — k + 1. Adding 
the inequalities (7.2) and taking into account that the union of the disjoint 
intervals (xtj- i, Xkj) (for 1 < i < n,0 < k < n{) has length less than S, 
from the absolute continuity property of / we get 


n 


^( F(bi ) - F(ai)) < s + s = 2s. 


i=1 


This proves that F is absolutely continuous. 


□ 


The absolute continuity property characterizes integral functions, as 
the following theorem shows. 

Theorem 7.3. Le I = [a, b] C M. A function f : / —> M is represent¬ 
able as 



for some g G L 1 (/) if and only if f is absolutely continuous. 


Proof. (Sufficiency) If / is representable as in (7.3), we have 




Hence, setting A = U,-(a;, bj), the absolute continuity property follows 
by the implication 


\g\ds < e. 


2zf l (A) < 8 
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The existence, given <5 > 0, of e > 0 with this property is ensured by 
Exercise 6.14 (with /x = 1 and v = gj. z? 1 ). 

(Necessity) According to Lemma 7.2, we can write / as the difference 
of two nonincreasing absolutely continuous functions. Hence, we can 
assume with no loss of generality that / is nonincreasing, and possibly 
adding to / a constant we shall assume that f(a) = 0. We extend / to the 
whole of M setting / = 0 in (—oo, a) and / = f(b) in ( b , oo). It is clear 
that this extension, that we still denote by /, retains the monotonicity and 
absolute continuity properties. 

By Theorem 6.23 we obtain a unique finite measure v on (M, SS (M)) 
without atoms (because / is continuous) such that / is the repartition 
function of v. Since / is constant on (—oo, a ) and on ( b , +oo), we 
obtain that v is concentrated on /, so that 

f(x) = v ((—oo, x]) = v ((a, x]) Vx e M. (7.4) 

Now, if we were able to show that v <<C 1/Jz? 1 , by the Radon-Nikodym 
theorem we would find g e L l (I) such that v = gJz? 1 , so that (7.4) 
would give 

fix) = f g(s)ds Vx e I. 

J a 

Hence, it remains to show that v <$C IjjSf 1 . Taking into account the 
identity v((a, b)) = f(b) — f(a), the absolute continuity property can 
be rewritten as follows: for any s > 0 there exists <5 > 0 such that 

^ l (A)<8 => v(A) < e 

for any finite union of open intervals Ac/. But, by approximation, 
the same implication holds for all open sets, because any such set is the 
countable union of open intervals. By Proposition 1.24, ensuring an ap¬ 
proximation from above with open sets, the same implication holds for 
Borel sets B c I as well. This proves that v <C t/Jz? 1 and concludes the 
proof. □ 

We will need the following nice and elementary covering theorem. 

Theorem 7.4 (Vitali covering theorem). Let { B, (x/ )}, G/ be a finite 
family of balls in a metric space (X, d). Then there exists J C / such 
that the balls {/?,-, (x,)}; e y are pairwise disjoint, and 

[J B n (xj) c [J B^.fixi). 

iel ieJ 


(7.5) 
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Proof. We proceed as follows: first we pick a ball with largest radius, 
then we remove all balls that intersect the first chosen ball and choose 
a second ball of largest radius among the remaining ones. We continue 
removing all balls that intersect the second chosen ball and picking a 
third ball of largest radius among the remaining ones, and so on. The 
process stops when either there is no ball left, i.e. when the remaining 
balls intersect at least one of the already chosen balls. The family of 
chosen balls is disjoint by construction. If x e B n (jc,-) and the ball B n (jq ) 
has not been chosen, then there is a chosen ball B rj (xj ) intersecting it, so 
that d (xj ,xj) < ri + rj . Moreover, if B, (xj ) is the first chosen ball with 
this property, then rj > r, (otherwise, if r, > rj, either the ball B n (xfi or 
a ball with larger radius would have been chosen, instead of B rj (xj)), so 
that d(xi, xj) < 2f j . It follows that 


d(x, xj) < d(x, Xi) + d(xi, xj) < rj + 2rj < 3r j. 


As a: is arbitrary, this proves (7.5). 


□ 


It is natural to think that the function g in (7.3) is, as in the classical 
fundamental theorem of integral calculus, the derivative of /. This is 
true, but far from being trivial, and it follows by the following weak con¬ 
tinuity result (due to Lebesgue) of integrable functions. We state the 
result even in more then one variable, as the proof in this case does not 
require any extra difficulty. 

Theorem 7.5 (Continuity in mean). Let f e if (M"). Then, for 11 - 

a.e. x e M" we have 



The terminology “continuity in mean” can be explained as follows: it is 
easy to show that the integral means 



of a continuous function / converge to f(x) as r | 0 for any x e M", 
because they belong to the interval 


[min /, max /]. 

B r (x) B r {X) 


The previous theorem tells us that the same convergence occurs, for T£"~ 
a.e. x e M", for any integrable function /. This simply follows by the 
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inequality 


— f 

to„r n J B r (x) 


f(y)dy — f(x) 


a)„r" 


f f(y) — f(x)dy 

JB r (x) 


^ — 1 

to„r” J Br (x) 


I/O) - f(x)\dy. 


By the local nature of this statement, the same property holds for locally 
integrable functions. 

Proof of Theorem 7.5. Given e, 5 > 0 and an open ball B = B R ( 0), it 
suffices to check that the set 


A := I x e B : limsup- [ 

r;o to n r n Jb 


I/O) - f(x)\dy > 2s 


B r (x) J 

has Lebesgue measure less than (3" + 1)5. To this aim, we write / as the 
sum of a “good” part g and a “bad”, but small, part h, i.e. f = g + h 
with g : B —M bounded and continuous, and \\h\\ L i^ < s8, with 
B' = B r+ i (0); this decomposition is possible, because Proposition 3.16 
ensures the density of bounded continuous functions in L l (B). 

The continuity of g gives 


lim 

r 4-0 O), 


—— f IgO) - §(x)\dy = 0 V.r e B. 

'n r J B,-(x) 


Hence, as / = g + h, we have A c A u where 

| h(y) — h(x)\ dy > 2s | . 


A\ := \x e B : limsup- f 

,qo co„r n J B 


BrW 


Then, it suffices to show that S£"(A\) < (3" + 1)5. By the triangle 
inequality, we have also A\ C A 2 U A 3 with 


and 


A 2 := {x e B : \h(x)\ > e} 

\h(y)\dy > s \ 


A 3 :=\x e B : sup - [ 

re(0,l) <X> n r n J B 


B,(x) 


Markov inequality ensures that £^ n (A 2 ) < \\h\\ L \ (B) /s < 5, so that we 
need only to show that ,£" (A 3 ) < 3 n 5. 

Since x f B (v) |/r(y)| dy is continuous, we have that 


sup 

re(0,l) 


-f 

n r" J Br 




\h(y)\ dy 
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is lower semi continuous, hence A 3 is open. Notice also that for any 
x e A 3 there exists r e (0, 1), depending on x, such that 



\h(y)\dy > eco n r n . 


Let K c A 3 be a compact set and let { B (x ,-, r,)}, e/ be a hnite family of 
these balls whose union covers K. By applying Yitali’s covering theorem 
to this family of balls, we can find a disjoint subfamily {#,, (.r, )}, s y such 
that the union of the enlarged balls B 3ri . (x,) still covers K. Adding the 
previous inequalities with x = x,- and r = r, and summing in i e J, 
since all balls B n (x,-) are contained in B' we get 


JZ n (K) < T®„(3r ( y < -V f \h(y)\dy 

<- f \h(y)\dy<rs. 

£ J B' 

As K is arbitrary we obtain that «Sf"(A 3 ) < 3"<5. □ 

Since the continuity in mean is a local property, it is not difficult to 
extend the previous result to locally integrable functions. By applying 
this extended theorem to a characteristic function / = l E we get 


lim 

3? H 

( E n B r (x)) 

for Jzf"-a.e. x 



— I 

G E 



co n r n 




Cf n 

(E fl B r (x)) 

for ^f"-a.e. x 


lim 


- 0 

eM n \E 

ri 0 


co n r n 



for any E e -Sd (M"); points of the first type are called density points, 
whereas points of the second type are called rarefaction points. 

Using the continuity in mean of integrable functions we obtain the 
fundamental theorem of calculus within the (natural) class of absolutely 
continuous functions. 


Theorem 7.6. Let I <zM.be an interval and let f : I —> Mbe absolutely 
continuous. Then f is differentiable at Jz? l -a.e. point of I. In addition 
f is Lebesgue integrable in I and 


fix) = 



f(s)ds 


Vx e I. (7.6) 


Proof. Let g be as in (7.3), let x 0 e I be a point where 


1 

lim - 


j-xo+r 


|g(s) - g(x 0 )|r/5 = 0 


(7.7) 
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and notice that 

f(xo + r)-f(x 0 ) 1 f X0+r ... 

-= -/ g(s)ds 

r r Jx o 

1 rxn+r 

= g Oo) + - / g(i) - g(x o) ds 
r Jx o 

for r > 0. Hence, passing to the limit as r | 0, from (7.7) we get 
/|(jc 0 ) = g(xo); a similar argument shows that f'_(x o) = g(xo). As, 
according to the previous theorem, «£?'-a.e. point x 0 satisfies (7.7), we 
obtain that / is differentiable, with derivative equal to g, .if'-a.e. in /. 

It suffices to replace g with f in (7.3) to obtain (7.6). □ 

One might think that differentiability Jzf ^a.e. and integrability of the 
derivative are sufficient for the validity of (7.6) (these are the minimal 
requirements to give a meaning to the formula). However, this is not 
true, as the Heaviside function l (0oo) fulfils these conditions but fails to be 
(absolutely) continuous. Then, one might think that one should require 
also the continuity of / to have (7.6). It turns out that not even this 
is enough: we build in the next example the Cantor-Vitali function, also 
called devil’s staircase: a continuous function having derivative equal to 0 
Jz?'-a.e., but not constant. This example shows why a stronger condition, 
namely the absolute continuity, is needed. 

Example 7.7 (Cantor-Vitali function). Let 


X := {/ e C([0, 1]) : /(0) = 0, /(!) = !}. 


This is a closed subspace of the complete metric space C([0, 1]), hence 
X is complete as well. For any / : [0, 1] i—► M we set 


Tf(x) := 


/( 3jc)/2 

1/2 

1/2 + / (3x — 2)/2 


if 0 < 3x < 1, 
if 1 < 3x < 2, 
if 2 < 3x < 3. 


(7.8) 


It is easy to see that T maps X into X, and that T is a contraction (with 
Lipschitz constant equal to 1/2). Hence, by the contraction principle, 
there is a unique / e X such that Tf = f. 

Let us check that / has zero derivative ££ ^a.e. in [0, 1], As / = Tf, 
f is constant, and equal to 1/2, in (1/3, 2/3). Inserting this information 
again in the identity f = Tf we obtain that / is locally constant (equal 
to 1/4 and to 3/4) on (1/9, 2/9) U (7/9, 8/9). Continuing in this way, 
one finds that / is locally constant on the union of 2" -1 intervals, each 
of length 3~ n , n > 1. The complement C = [0, 1] \ A of the union A 
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of these intervals is Cantor’s middle third set (see also Exercise 1.8), and 
since 


jSf'(A) 




= 1 


we know that J£ l (C) = 0. At any point of A the derivative of / is 
obviously 0. 

In connection with the previous example, notice also that / maps A, 
a set of full Lebesgue measure in [0, 1], into the countable set {2 - "}„>i. 
On the other hand, it maps C, a Lebesgue negligible set, into [0, 1], a set 
with strictly positive Lebesgue measure. 


Exercises 

7.1 Let H : R —> R be satisfying the Lipschitz condition 

\H{x) - H(y)\ < C\x - y\ Vx, yeR 

and let / : [a, b\ —> R. be an absolutely continuous function. Show that Hof 
is absolutely continuous in [a, b]. 

7.2 ★ Let £ C K be a Borel set and assume that any ( e R is either a point 
of density or a point of rarefaction of E. Show that either 2z? l (E) = 0 or 
Jz? 1 (M \ E) =0. {Remark: the same result is true in R", but with a much harder 
proof, see [3], 4.5.11). 

7.3[Lipschitz change of variables] ★ Let / : I = [a, b] ->■ R be absolutely 
continuous (resp. Lipschitz). Show that 

rf(b) rb 

/ <p(y)dy= / <p(f(x))f'(x)dx 
J f(a) Ja 

for any bounded (resp. integrable) Borel function <p : /(/) —> R. 

7.4 Use the previous exercise to show that, for any Lipschitz function / : R —»■ 
R and any ££'-negligible set N e ^(R), the derivative f vanishes .Sf^-a.e. 
on 



Chapter 8 

Measurable transformations 


In this chapter we study the classical problem of the change of variables 
in the integral from a new viewpoint. We will compute how the Lebesgue 
measure in K" changes under a sufficiently regular transformation, gen¬ 
eralizing what we have already seen for linear, or affine, maps. As a 
byproduct we obtain a quite general change of variables formula for in¬ 
tegrals with respect to the Lebesgue measure. 

8.1. Image measure 

We are given two measurable spaces (X, S) and (Y, SP), a measure n 
on ( X , S’) and a (S\ J^)-measurable mapping F: X —> Y. We define a 
measure F#pi in (7, J^") by setting 

F#fi(I) ■— fi(F~ l (I)), IeSP. (8.1) 

It is easy to see that F#/jl is well defined, by the measurability assumption 
on F, and a -additive on ■'F. F#fi is called the image measure of n by F. 

The following change of variable formula is simple, but of a basic 
importance. 

Proposition 8.1. Let(p:Y —> [0, oo] be a measurable function. Then 

we have 

f <p(F(x))d[i(x) - [ <p(y)dF#fi(y). (8.2) 

Jx Jy 

Proof. By monotone approximation it is enough to prove (8.2) when ip 
is a simple function. By linearity of both sides we need only to consider 
functions <p of the form ip = 1 7 , where I e ■'F. In this case we have 
<p o F — hence (8.2) reduces to (8.1). P 

In the following example we discuss the relation between the change of 
variables formula (8.2), that even on the real line involves no derivative, 
and the classical one. The difference is due to the fact that in (8.2) we are 
not using the density of T#/z with respect to .if 1 . It is precisely in this 
density that the derivative of F shows up. 
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Example 8.2. Let F : M —> M be of class C 1 and such that F'{t) > 0 for 
all ( el. Let A be the image of F (an open interval, by the assumptions 
made on F) and let fr : A —>■ M be continuous. Then for any interval 
\a , b] C A the following elementary formula of change of variables holds 
(just put y = Fix) in the right integral): 


/ 

Jf-' 


F~ l (b) 


V r(F{x))dx= f if(y) -t- dy. 

(a) Ja F'(F-'(y)) 


1 


On the other hand, choosing cp = frtj with I = [a , b ] in (8.2), we have 

r-F~ l (b) nb 

/ f/(F(x))dx = / if(y)dF # £? 1 . 
df-l(a) Ja 


Hence, comparing the two expressions, we find 

nb 1 rb 


[ fiy) -t- dy — [ ir{y) dF#J£ l . (8.3) 

Ja F'(F~Hy)) - Ja * - 


Since a, b and \f/ are arbitrary, (8.3) can be interpreted by saying that 

F#F £ 1 < J^ 1 and 


In the next section, we shall generalize this formula to K", and even in one 
space dimension we will see that the assumption that F' > 0 everywhere 
can be weakened (see also Exercise 8.3). 


8.2. Change of variables in multiple integrals 

We consider here the measure space (M", SS (K"), where .if" is the 
Lebesgue measure. 

We recall a few basic facts from calculus with several variables: given 
an open set U C M" and a mapping F : U M", F is said to be differ¬ 
entiable at x e U if there exists a linear operator DF{x ) e L(M"; M") (1) 
such that 

\F(x + h)-F(x)-DF(x)h\ 

hm -= 0. 

W^o \h | 

The operator DF(x) if exists is unique, and is called the differential of F 
at x. If F is affine, i.e. F(x) = Tx + a for some T e L(M"; M") and 
a e M", we have DF(x ) = T for all x £ U. 


(1 * L (R"; R m ) is the Banach space of all linear mappings T : R" —>• R m endowed with the sup norm 
|| 3" || = suptlTxl : x e R", \x\ = 1} 
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If F is differentiable at x £ U we define the Jacobian determinant J F (x ) 
of F at x by setting 

J F (x) = det DF(x). 

If F is differentiable at any x e U and if the mapping DF .U —> 
L{ M"; M") is continuous, we say that F is of class C l . If, in addition, 
F is bijective between U and an open domain A and F~ l is of class C 1 
in A, we say that F is a C 1 diffeomorphism of U onto A. In this case we 
have that DF(x) is invertible and 

D(F~ l )(F (x)) = ( DF(x))~ l V.r e U. 

Finally, by Proposition 6.10 we know that if T £ L(M"; M") we have 

& n {T{E)) = |detr| f£ n {E) WE e 38(W 1 ). (8.4) 

8.3. Image measure of Jz?” by a C 1 diffeomorphism 

In this section we study how the Lebesgue measure changes under the 
action of a C 1 map F. The relevant quantity will be the function \J F |, 
which really corresponds to the distorsion factor of the measure. 

Let U C R" be open. The critical set CY of F £ C l (U ; R") is defined 
by 

C F :={x eU : J F {x) = 0}. 

Lemma 8.3. The image F(C f ) of the critical set is Lebesgue negligible. 

Proof. Let K c C F be a compact set and e > 0; for any x £ K the set 
DF(x)(B i(0)) is Lebesgue negligible (because DF is singular at x, so 
that DF(x)( M") is contained in a (n — 1)-dimensional subspace of M"), 
hence we can find <$ = <$(£,v)>0 such that 

JSf" ({z £ R" : dist (z - F(x), DF(x)(B\(0))) < 5}) < e. 

By a scaling argument we get 

JSf" ({z £ M" : dist (z - F(x), DF{x){B r { 0))) < 5r}) < er n Vr > 0. 

On the other hand, since |F(y) — F(x ) — DF{x)(y — x)\ < Sr in B r (x), 
provided r is small enough, we get 

F(B r (x )) c{ze M" : dist (z - F(x), DF(x)(B r ( 0)) < 5r} . 

It follows that B r (x) C U and ££"(F(B r (x))) < sr n for r >0 small 
enough, depending on x. 
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Since the family of balls {B r p(x)} xe K covers the compact set K, we can 
find a finite family {fi/-,/ 3 (-r,)}, e / whose union still covers K and extract 
from it, thanks to Vitali’s covering theorem, a subfamily { B n p(xi)}i € j 
made by pairwise disjoint balls such that the union of the enlarged balls 
{B r .(xi)}i e j covers K. In particular, covering F(K) by the union of 
F(B n (xi )) for i e J, we get 


JS?"( F(K )) < E er 'i 

ieJ 



3 n e 

- Ff'fU). 

OJ n 


Letting £ j, 0 we obtain that .2 9 ' 1 (F(K)) = 0. Since K is arbitrary, 
by approximation (recall that C F , being a closed subset of U, can be 
written as the countable union of compact subsets of U) we obtain that 
j¥ n (F(C F )) = 0. □ 


The following theorem provides a necessary and sufficient condition 
for the absolute continuity of F#Jt? " with respect to F£ n , assuming a C 1 
regularity of F. 

Theorem 8.4. Let U c W 1 be an open set and let F : U —> M" be of 
class C 1 , whose restriction to U \ C F is injective. Then: 

(i) Fu(ltj. ( / Jn ) is absolutely continuous with respect to Jz?" if and only if 
Cf is Lebesgue negligible. 

(ii) If F#(l u JZ’ n ) F£ n we have 

F#{^u^ n ) = \j F ^ F _^t F( u\c F )-2”'- (8-5) 

Proof, (i) If S? n (C F ) > 0, we have F # (l f/ ^f' , )(F(C f )) > ££ n {C F ) > 
0 and F//(t (/ .5f" ) fails to be absolutely continuous with respect to , f £", 
because we proved in Lemma 8.3 that F(C f ) is Lebesgue negligible. 

Let G be the inverse of the restriction of F to the open set U \ C F . The 
local invertibility theorem ensures that the domain A = F(U\C f ) of G is 
an open set, that G is of class C 1 in A and that DG(y ) = ( DF)~ l (G(y )) 
for all y e A. Let us assume now that C F is Lebesgue negligible and 
let us show that F~ l (E) is Lebesgue negligible whenever E C F(U) is 
Lebesgue negligible. Since we already know that C F is "-negligible 
set, we can assume with no loss of generality that E C A and show that 
G(E ) is Lebesgue negligible. Let A M be the open sets 


A M :={ye A: ||DG(y)|| < M}. 


We will prove that 


J ¥’\G{K)) < (3M)"Jzf' ! (F) 


(8.6) 
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for any compact set K C A M . So, F#J£ n < (3on the compact 
sets of A m and therefore on the Borel sets; in particular 

J T\G{E n A m )) < (3 M) n Ef n (E n A M ) = 0, 

and letting M f oowe obtain that J£ n {G(E)) = 0, because E c A. 

In order to show (8.6) we consider a bounded open set B contained in 
A m and containing K, and the family of balls B r (y) C B with y e K 
and r > 0. For any of these balls the mean value theorem gives (with 
t = t(y, z) e (0,1)) 

|G(z)-G()0| = \DG((l-t)y+tz)(z-y)\ < M\z-y\ Vz e B r (y), 


therefore G(B r (y)) c /I v/,-(G(y)) for any of these balls. Since the family 
of balls {B r p(y)} ye p covers K, we can find a hnite family {fi r ,/ 3 (y/)}ie/ 
whose union still covers K and extract from it, thanks to Yitali’s covering 
theorem, a subfamily 73 (y; )},<=./ made by pairwise disjoint balls such 
that the union of the enlarged balls { B n (y, )} ;<E j covers K. In particular, 
by our choice of the radii of the balls, the family {B Mn (G(v,))}/ e y covers 
G(K). We have then 

E£ n {G(K)) < Y,<°nWri) n = (^)" < (3 M) n ^'\B). 

ieJ ieJ ^ 


Letting B \, K we obtain (8.6). 

Let us prove (ii). We denote by h the Radon-Nikodym derivative of 
/^(Ij/.Sf' 1 ) with respect to Jzf"; by Theorem 7.5 we have that 


1 f 

h( y) — lim- / 

/•to co„r n J B 


h(z) dz = lim 
B r OO GO 




co„r n 

for "-a.e. y e A. 


Taking into account that F#(l {/ «Sf") is concentrated on A, and that 
\/\Jf I o E~ l = | J (j |, it remains to prove that for all y 0 e A we have 


lim 

ri 0 


3? n (G(B r (y 0 ))) 
co n r n 


= l^cl(yo)- 


(8.7) 


For the sake of simplicity we only consider the case when y 0 = 0 and 
G(0) = 0 (this is not restrictive, up to a translation in the domain and in 
the codomain). We divide the rest of the proof in two steps. 

Step 1. We assume in addition that DG(0) = I and show that 

«Sf"(G(B r (0))) 

lim-= 1, 

GO co n r" 


which is equivalent to (8.7) in this case. 


( 8 . 8 ) 
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Since DF{ 0) = DG{ 0) — I we have by the definition of derivative, 


,. \F(x)-x\ 

lim -= 0, 

lyl->o 1*1 


lim w-y' = 0 


|y|->o 


M 


So, for any s e (0, 1) there exists S £ > 0 such that if |x| < 8 e we have 
x e U\Cp and |F(x)— x\ < e\x\ and if |y| < 8 e we have v G F(U\Cp ) 
and | G(y) — y| < e \ y \ . It follows that r < 8 e implies 

|F(x)| < r Vie fl (1 _ e)r ( 0), |G(y)| < (1 + e)r Vy e B r ( 0). 

(8.9) 


In particular 

fl(i-«)r(0) C G(B, (0)) c B a+S)r ( 0) Vr < S e . (8.10) 

Now, by (8.10) it follows that 


(i - s) n < 


& n (G(B r ( 0))) 
co n r" 


< (1 +£)", 


provided r < 8 , : , and this proves that (8.8) holds. 

Step 2. Set T = DG{ 0) and H(x) = T~ l G(x), so that DH( 0) = /. 
Then we have G(B, ( 0)) = T(H(B r (0))) and so, thanks to (8.4), 


-Sf"(G(fi r (0))) = J? n (T(H(B r (0m = |detr| JZ n (H{B r ( 0))), 


which implies 


Jz?"(G(.B,. (0))) & n (H(B r ( 0))) 

lim- = | det T| lim-= | detT’l 

NO a> n r n rio co„r n 


The proof is complete. □ 

Example 8.5 (Polar and spherical coordinates). Let us consider the 
polar coordinates 

(p, 9) i —> (p cos 9, p sin0). 

Here U = (0, oo) x (0, 2tt) and the critical set is empty, because the 
modulus of the Jacobian determinant is p. 

In the case of the spherical coordinates 


(p,9,<p) i—► (p cos 9 sin0, p sin# sin0, pcos^) 


we have U = (0, oo) x (0, 2tt) x (0, n) and the critical set is empty, 
because the modulus of the Jacobian determinant is —p 2 sin </>. 
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Theorem 8.6 (Change of variables formula). Let U C W 1 be an open 
set and let F : U M" of class C 1 , injective on U \Cp. Then 



<p(y) dy = 


L 


<p(F(x))\J f\(x) dx 


( 8 . 11 ) 


for any Borel function <p: F{JJ) —r [0, +oo]. 

Proof. We first see that it is not restrictive to assume that Cf = 0; indeed, 
F(Cf) is Lebegue negligible and so images of points in CY do not affect 
the left hand side, while obviously points in Cf do not affect the right 
hand side. So, possibly replacing U with U \ Cp, we can assume that 
Cf — 0 - 

By (8.2) and (8.5) we have 

f - t&l - dy= f f(F(x))dx. 

Jf ( u ) I/fIOF-'OO) Ju 


for any nonnegative Borel function f. We conclude choosing f(y) = 
(p(y)\J F \(F~ l (y)). □ 


Exercises 

8.1 Let ( X , , r F), ( Y , &) and (Z, Jif) be measurable spaces and let / : X — Y, 
g : Y —>• Z be measurable maps. Show that 

g#(f#d) = (g° f)#d 

for any measure // in ( X , ,F). 

8.2 Let / : {0, 1} N —»■ [0, 1] be the map associating to a sequence (a,) C {0, 1} 

the real number a,2~‘~ l e [0, 1]. Show that 

/# ^ X (-So = Vi]-^ 1 - 

8.3 ★ Show the existence of a strictly increasing and C 1 function F : R. —► K 
such that F#FF 1 is not absolutely continuous with respect to Jzf 1 . 

8.4 ★ ★ Remove the injectivity assumption in Theorem 8.4, showing that 

F#(l ^ n) = E 

xeF-'(r)\C f 1 lw 


for any C 1 function F : U —*■ R" with Lebesgue negligible critical set. 




Appendix A 


A.l. Continuity and differentiability of functions depending 
on a parameter 

In this section we consider the following problem: we are given a metric 
space (X, d) and a measure space (Y, ^,/x). Given / : X x Y —>■ M, we 
assume that for all x e X the function f(x, •) is /x-integrable, so that the 
function F : X —> M given by 

F (x) ■= J f(x,y)d[i(y) xeX 

is well defined. We would like to understand under which conditions F, 
an integral depending on the parameter x, is continuous. When X is an 
open subset of M" endowed with the Euclidean distance, it is also natural 
to investigate the differentiability properties of F. 

Theorem A.l (Continuity of F). Assume that f(-,y ) is continuous in 
X for /x-almost all y e Y and that there exists m e L 1 (Y, fx) satisfying 

sup \f(x, y) | < m(y) for / x-a.e . y eY. (A.l) 

X€X 

Then F is bounded and continuous in X. 

Proof. It is clear that |Y(x)| < ||m||i for all x e X. Continuity is a 
simple consequence of the dominated convergence theorem: indeed, if 
x n e X converge to x, then / (x „, y) converge to f(x, y) for //-almost 
every y and the convergence is dominated because of (A.l). It follows 
that F(x n ) F(x). □ 

A more expressive way to state the continuity of F is to say that limit 
and integral commute, namely 

lim / / (x h , y) duty) = / lim f(x h ,y)dfx(y). 

h— YX) / Y J y /z-A-OO 
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The following example shows that if no uniform upper bound is imposed 
on /, then continuity might fail: 

Example A.2. Let X = Y — M, /x = Jz? 1 and 


/(*, y ) 


1*1(1-MW) ifMl*| < i; 

0 ifMI*l>l. 


Then F(x) = 1 for all r /0, while F (0) = 0. In this case the smallest 
possible function satisfying (A.l) is |y| -1 which is not integrable. 

Next, we assume that X is an open set of M" endowed with the Euc¬ 
lidean distance and we investigate the differentiability of F. Under suit¬ 
able assumption, we can commute derivative and integral, namely 

3 [ f df 

— / f(x,y)dn(y)= / -—(*, y)dniy) Vx e X, i = 1,..., n. 

OXi Jy JY ° x i 

(A.2) 

Theorem A.3 (Differentiability of F). Assume that for /i-almost all y e 
Y the function /'(•, y) is differentiable in X with a continuous gradient 
V v /(x, v) and that, for any ball B, (xq) C X, there exists m e L l (Y, /x) 
satisfying 


\f(xo, y)| + sup \V x f\(x, y) < m(y) for \i-a.e. y eY. (A.3) 

xeB r (x 0 ) 


Then F e C l {X) and (A.2) holds. 


Proof. We fix xo e X, i e {1. n] and x, = x + t,e,- with ?, 0 and 

ti —> 0. The mean value theorem, applied for any y such that /(•, y) e 
C ! (A), gives 9i(y ) e (0, 1) satisfying 


Fjx o + t/e f ) - F(x q) 
U 


L 


— (x 0 + 0i(y)tiei, y) dfi(y) 

OXi 


For i large enough (as soon as \tj\ < r) the functions of y inside the 
integral are dominated by the function m in (A.3), hence we can pass to 
the limit with the dominated convergence theorem to get (notice that the 
measurability of 3//3x, (xq, •) follows by the same limiting process) 


3 F 

dxj 


Oo) = 


f x~i x o^y) d diy)- 

Jy 9x; 


Finally, continuity of partial derivatives of F is a consequence of the 
previous theorem. □ 
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Of course similar statements can be given for k- th order derivatives of F, 
provided /(•, y ) is k times differentiable and, for any ball B r (x o) C R" 
there exists m e LHY, p) satisfying 

l/Oo, y)\ + sup sup \D p f\(x, y ) < m(y ) for /x-a.e. y £ F 

xeB r (x 0 ) ip|<t- 

(here p = (p 1 , ■.., p«) and |/;| = pi + • • ■ + /?„)■ Under this assumption 
one obtains that 

J fix, y) dp{y) = J D p f(x, y) dp{y) whenever \p\ < k. 

A.2. The dual space of continuous functions 

In this section we want to characterize the space (C(X))*, dual space of 
C(X), with (X, d) compact metric space. Recall that C(X) is a Banach 
space, when endowed with the sup norm, regardless of any assumption 
on (X, d). Some knowledge of the basic terminology of Banach spaces 
(dual space, dual norm) is needed for this section. 

We start with some notation: we shall denote by (X) the space of 
signed measures p, i.e. the real-valued and o -additive set functions p, 
defined on 3§{X), of the form p = /x + — with positive and finite 
Borel measures satisfying p + _L p~. 

This orthogonality condition ensures uniqueness of the decomposition 
of p, as we will see in a moment; existence, instead, is just a consequence 
of the a -additivity (see Section 6.5), but we shall not use this fact in the 
sequel. 

For p e .#(X) we denote \p\ = p + + p~ its total variation measure, 
as in Section 6.5, and set 

\\p\\:=\p\(X) = p + (X) + p~(X). (A.4) 

In the next proposition we show that the decomposition p = p + — p~ 
is unique, so that (A.4) is well posed, and that ../0, (X) is a normed space. 
The completeness of .JY (X) will be a consequence of Theorem A.6, since 
any dual space is complete. 

Proposition A.4. For any p e ■ -// (X) the decomposition p = p + — p~ 
is unique. In addition .M(X), endowed with the norm (A.4), is a normed 
space. 

Proof. Assume that p = p + — p~ = p + — p~ , with orthogonal decom¬ 
positions. Let A be a Borel set where p + is concentrated, so that p~ is 
concentrated on X \ A, and let A be an analogous Borel set for p ± . Since 
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/x > 0 (respectively /x < 0) on the subsets of A (respectively of A \ A) 
and the same property holds for A, we obtain that /x (and therefore /x ± 
and /xA) vanishes on subsets of A \ A and of A \ A. On the other hand, if 
fi C Afl Awe have p~(B) = jl~(B) = 0 and 

/x+(fi) = ( 1 (B) = a + {B). 


Analogously, if B C (A \ A) PI (A \ A) we have pt + (B) = jl + (B) = 0 
and p~(B ) = ji~(B). This proves that /x ± = /x ± . 

Now, stability of -M (A) under multiplication with real constants and 
1-homogeneity of the norm are obvious. Let us prove stability under 
addition and subadditivity of the norm: if /x = /x + — /x“ and v = v + — v~ 
we can write as before /x = f\p\ and v — g|v| with f,g:X—>[— 1,1]. 
Then, setting a = |/x| + |v|, the Radon-Nikodym theorem gives |/x| = aa 
and |v| — ba for suitable a, b : X —> [0, 1], so that 

V + v = f\(i\ + g\\v\ = (fa + gb)o 

and we may take (fa + gb^a as positive and negative parts of /x. + v. 
We obtain also 

IIm + v||= [ \fa + gb\do < f \a\ + \b\dcr = ||pt|| + ||v||. 

Jx Jx 

This completes the proof of the proposition. □ 

We shall also denote by s/(X) the collection of open subsets of X 
and use the following characterization of set functions defined on .o/(X) 
which are restrictions of o -additive measures defined on the Borel o- 
algebra. 

Proposition A.5. Let (A, d) be a compact metric space and let a : 
.s/( A) —> [0, +oo] be a nondecreasing set function satisfying a(0) = 0 
and: 

(i) (continuity) if A n £ x/(A), n £ N, monotonically converge from 
below’ to A, then a(A n ) f a(A); 

(ii) ( subadditivity) a(A\ U Af) < u(A]) + a(A 2 ) for all A\, A 2 £ 
*f(X); 

(iii) (additivity on disjoint sets) a(Ai U A 2 ) = a(A\ ) + a(A 2 ) whenever 
Ai £ g/(X) and A 2 £ ,<//(X) are disjoint. 


Then 

a(B ) := inf [a(A) : A £ s/(X), A D B} (A.5) 

is a o -additive extension of a to SX (A). 
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Proof. Notice first that a is a -subadditive on <J(Xy. indeed, if A C U,- A,- 
and B is an open set with compact closure in A, then B is contained in 
the union of finitely many A, ’s, so that (ii) gives 

OO 

a(B) < y^ajAj). 

i =1 

Since B is arbitrary, (i) gives a (A) < a (A,). 

Now, if we take (A.5) as the definition of a for all subsets of X, Pro¬ 
position 1.16 gives that a extends a and is o -subadditive. Then, The¬ 
orem 1.17 gives that a is o -additive on the Borel cr-algebra, provided 
we are able to show that any Borel set is d-additive. Since the class of 
additive sets is a a-algebra, suffices to show that any closed set is a- 
additive. 

To this aim, we first show that a is additive on distant sets, namely 
(recall that dist(C/, V) is the infimum of the distances d(x, y) for x e U 
and y e V) 

a(Bi U B 2 ) = oc{B\) + a(B 2 ) whenever dist(Bi, B 2 ) > 0. (A.6) 
Indeed, if A D Si U B 2 is open we can consider the disjoint open sets 

A] := {x € A : dist(x, B\) < dist(x, B 2 )}, 

A 2 := {x e A : dist(x, B 2 ) < dist(x, B 2 )} 

containing B\ and B 2 respectively to get 

or (A) > a(A { U A 2 ) = a(Ai) + a(A 2 ) > a(B|) + a(B 2 ). 

Since A is arbitrary the inequality > in (A.6) follows, while the converse 
one is a consequence of subadditivity. 

Let F c A be closed, B c X and let us prove that a(B n F) + a(B \ 
F) < a(B) (the opposite inequality follows by subadditivity). Assuming 
with no loss of generality a(B) < oo and setting 

B h := {x e B : 2 h > dist(x, F) > 2 /l_l } he Z 

the additivity on distant sets gives 

^2a(B 2h ) < a(B) < oo, 

/zeZ 


^a(B 2 /,+ 1 ) < a(B) < oo 
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because all finite sums are made on distant sets, all contained in B. We 
have then that sZ a(B/,) is convergent and, since the sets B h are a 
partition of B \ F, using once more the additivity on distant sets we get 


( N \ oo 

LJ a (5/,) 

h =—oo ' /i=A+l 

( N \ oo 

(B n F) U [J B h ) + ^ a(Bft) 

h=—o o ' 1 

oo 

< a(B) + ^ d(5/,) 

A=JV+1 


for any IV > 1. Letting N —> oo the inequality follows. 
For g e C (X) we can define 


/ gd/i := / gd/z + - 
Jx dx Jx 


gj/x . 


In this way f gd/i is linear w.r.t. g; in addition, since 


f^-r 

Jx Jo 

= f 


/Jt + {{h > f}) dt 


/i({h > t}) dt 




/z ({h > t}) dt 


□ 


whenever h is nonnegative, splitting g in positive and negative part we 
obtain that f x g d\x is also linear w.r.t. to /z. Since 



< f |g| <i/z + + [ \g\dn < max|g|||/z|| = ||g||||/x|| 
Jx Jx 


Vg e C(X) 


the functional 

Mg):= [ gdd geC(X) (A.7) 

Jx 

belongs to (C(X))* and satishes || | < ||/z||. The remarkable fact 

is that any element in the dual is representable in this form, and that 
equality holds. This will also prove that ../0, (A) is a Banach space (with 
the definition of - X/ (X) given above, independent of Section 6.5, it is not 
even totally obvious that it is a linear space!). 
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Theorem A.6 (Riesz). Let ( X , d) be a compact metric space. The space 
(C(X))* is, via (A.7), isomorphic and isometric to ./f/(K). That is: all 
functionals belong to (C(X))* and, for any L e (C( X ))*, there exists 
a unique p G satisfying L = L fJL . Finally, UL^J = \\p\\. 

Proof. The proof will be achieved in three steps. In the first one we build 
an auxiliary positive finite measure p* and prove in the second one that 
p* provides the desired representation of L when L is nondecreasing. 
In the last one we achieve the general case and provide equality of the 
norms. 

Step 1. Let a* : g/(X) [0, +oo) be defined by 

a*(A) := sup{|L(g)| : |g| < 1, suppgcA}. 

Notice that a*(X) < || L | and that a*(0) = 0. Notice also that we can 
equivalently replace |L(g)| with L(g) inside the supremum and that a 
simple approximation argument gives 

a*(A) > |L(g)| whenever|g| < l A . (A.8) 

Indeed, if |g| < l A we can find continuous functions g n : X —> [— 1, 1] 
convergent to g and with support contained in A. In addition, if L is 
monotone we have also 

a*(A) < L(x) whenever 1 A < x■ (A.9) 

We claim that a* satisfies all the assumption of Proposition A.5. Indeed, 
if g e C (A) has support contained in A, since the support is compact 
we have K C A, for i large enough; it follows that L(g) < a*(A,) < 
sup / a*(Aj) and since g is arbitrary the continuity follows. In order to 
prove the subadditivity, given a continuous g : X —> [—1, 1] with support 
K contained in A\ fl Ai, we can consider the disjoint compact sets K \ 
A] and K \ A 2 and a continuous function / : X —► [0, 1] identically 
equal to 1 in a neighbourhood of K \ Ai and identically equal to 0 in a 
neighbourhood of K\ At. It follows that (1 — x)g has support contained 
in Ai and yg has support contained in A%, hence 

L(g) = L(( 1 - X)g) + L(xg) < a*(Ai) + a*(A 2 ). 

Since g is arbitrary, the subadditivity of a* follows. Finally, to prove the 
additivity on disjoint sets it suffices to notice that, given g, with support 
in Aj and |g,j < 1, the function g = gi + g: has support in A\ U A 2 and 
satisfies L(g) = L(gO + L(g 2 ) and \g\ < 1. 
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By Proposition A.5 we obtain that a* is the restriction to -<J(X) of a 
positive measure /x*. Notice also that /x* is finite, since 

fi*(X) = a*(X)=\\L\\. (A. 10) 


Step 2. Now we claim that L > |L|, namely L^*(g) > |L(g)| for any 
nonnegative g e C(X). Also, we shall prove that if L is nondecreasing, 
namely L(g) > 0 whenever g e C(A) is nonnegative, then L^* coincides 
with L. This proves already Riesz theorem for positive functionals. 

By homogeneity, in the proof of the inequality L^*{g) > |L(g)|, it is 
not restrictive to assume 0 < 8 < I- Given an integer N > 1, let us 
consider the open sets A, := {g > i/N}, i = 0, ..., N — 1, and notice 
that 



Now, given continuous functions x; : X > [0, 1] satisfying 1 a ; < Xi < 
1^._ , i = 1,..., N, we can use (A.8) to estimate 



we can let N —► oo and use the continuity of L to get L^*{g) > |L(g)|. 
If L is also monotone we can use the inequality (A.9) to get 





Again we can let A —>• oo and use (A. 12) to get L jjL *(g) = L(g). 

Step 3. Now we define linear continuous functionals : C(X) —> M 
by 


L + (g ) 


+ L(g) 


L~(g ) 


V(g)-^(g) 

2 


We have L + + L~ = and L + — L~ = L. In addition, by Step 2, L ± 
are monotone. 

Now we can apply the construction of Step 1 and use monotonicity in 
Step 2 to find positive finite measures /x ± such that L ± = L^. It follows 
that 

L = L L = L^+ = Lp 
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and the representation of L follows. Analogously, we obtain that 

L/_i* — L + L = L^+ + L = L M + +/i - 

so that /A* = /x + + /x _ . To conclude, we identify ||/x|| with ||L|| and show 
that /x + and /x“ are orthogonal. The bound on ||/x|| follows by (A. 10): 


IIMII = d + (X) + fi~(X) = L m+ (1) + L„-( 1) = V(l) = ll^ll- 

In order to show that /x + ± /x“, write /r ± = a^/i* and use the identity 
/x* = /x + + /x“ to get a + + a~ = 1 /x*-a.e. in A. On the other hand 
the density of C(X) in L 2 (X, n*) and a truncation argument provide 
a sequence of continuous functions g„ : X —> [—1,1] convergent in 
L 2 (X, /a*) to the sign of a + — a~, so that 


||L|| = sup |L M (g)| = sup 
l«l<i l«l<i 


J ( a + — a )gd/A* 



a | clfA*. 


Hence 

f (1 - \a + -a~\) dfA* = ia*(X) - ||L|| <0. 

Jx 

Since \a + — a~\ < 1 it must be \a + — a~\ = 1 /z*-a.e. in X. Since 
a ± e [0, 1] ii- a.e., this can only happen if a + a~ = 0 /x*-a.e. in X, 
which means that /x + is orthogonal to /x“. □ 

Remark A.7. A similar result holds, with minor changes in the proof, 
if (X, d) is locally compact and separable, namely there exists an non¬ 
decreasing sequence of open sets with compact closure whose union is 
the whole of X. In this case C(X) has to be replaced by Co (A), namely 
the closure in C(A) of the space C c (A) of compactly supported func¬ 
tions, while jjt(X') remains unchanged. 




Solutions of some exercises 


In this chapter we provide solutions to the main exercises proposed in the 
text, and in particular of those marked with one or two ★. 

Chapter 1 

Exercise 1.1. All verifications are very simple and we omit them. 

Exercise 1.2. We prove the statement for the translations, the proof for 
the dilations being similar. Fix h e R and consider the class 

: A + /ie J(I)}. 

Then & is a a -algebra containing the intervals, because the class J" of 
intervals is invariant under translations. Therefore & D cr FA) = SS (R). 
This proves that A + h is Borel whenever A is Borel. 

Exercise 1.3. Set X = N and /z := S n . Then the sets A n := \n, n + 
1, ...} satisfy /x(A„) = +oo, but their intersection is empty. 

Exercise 1.4. Let A n \ A with A n , A e ■«/. Then the sets B n := A \ A n 
satisfy B n 0, so that by assumption /x(f? ;i ) l /z(0) = 0. Since /z is 
finite, ii(B n ) = /z(A) — /r(A„), so that /z(A„) t /x(A). 

Exercise 1.5. For any n e N* the set A n of all atoms x such that n({x}) > 
l/n has at most cardinality n/i(X): indeed, if we choose k elements 
x\, ... ,Xk in this sets, adding the inequalities /r({x, }) > l/n we hnd 
k/n < n(X), whence the upper bound on the cardinality of A„ follows. 

If /x is cr—finite, we choose Xj f X with X, € S’ and /x(A, ) < oo 
and repeat the previous argument with the sets A i n := {x G A lt fl A, : 
r(W) > l/«}, whose union gives A }1 . If not hniteness assumption is 
made, the statement fails: take X = R, S’ = XA (1R) and fi(A) = 0 if 
A = 0 and /x(A) = +oo otherwise. 

Exercise 1.6. Let // be diffuse. First we prove that for all r e (0, 1) and 
all A e S’ there exists a subset B e S’ with 0 < /x(B) < r/x(A). Indeed, 
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if this property fails for some r and A, for all subsets B either n(B) = 0 
or /x(B) > r/x(A). Now, choose B[ C A with /x(5i) e (0, /x(A)) (this is 
possible by assumption), then B 2 C A \ B[ with /x(B 2 ) £ (0- /x(B i)) and 
so on. Since all these sets are contained in A, we have /x(B,) > r/i (A), 
and this contradicts the fact that they are disjoint. 

Now, given t e (0, ii(X)) we define a sequence of pairwise disjoint 
sets Bi and numbers as follows: first set 

si := sup {/z(B) : /x(B) < t } 

and then choose B\ with t > /x(Bi) > sj/2; then recursively set 

S n+ i := sup {/x(B) : B c /x(B) < t - /x(B„)} 

and choose B n+ \ c with t — /x(B n ) > /x(S„ + 1 ) > s„ + i/2. We now 
claim that /x(U, B,) = t. If this property fails, then //( B ,) < t and the 
convergence of the series implies that s,- —»■ 0. On the other hand 


Si > sup I /x(5) : B c X \ U B,, /x(B) <f-^/r(B,) 


The previous property with A = X \U,B; and r = (t — JT /x(B,))//x(A) 
shows that the supremum in the right hand side (independent of z) is 
positive, contradicting the fact that s, 0. 

Exercise 1.7. Let X be a separable metric space and let S’ = S3 (X). If 
/x({x}) > 0 for some x e X, obviously /x is not diffuse. Conversely, 
if A e SS(X) is given, with /x(A) > 0 and fx(B) e {0, /z(A)} for all 
B c A, we can fix a countable dense set (x ( ) C X and define 

r 0 := sup {r > 0 : /x(A 0 B r (x 0 )) = 0} . 

Since r i->- fi(ADB r (x 0 )) is right continuous, the maximality of r 0 easily 
implies that /i(A fl B ro (x 0 )) > 0, and therefore [i(A fl B ro (x 0 )) = /x(A). 
Now we iterate this construction, setting Aj := A D B, 0 (x o), defining 

r! := sup {r > 0 : /x(Ai 0 B r (xi)) = 0} , 

so that /x(Ai fl B ri (x i)) = /x(A[) = /x(A). Continuing in this way, we 
have a nonincreasing family of sets (A,) with /x(A,) = /z(A); it follows 
that /x(n, d,) = ii(A) > 0. On the other hand, any point x e P| ( A,- 
satisfies 


c/(.\', Xj) = r i 


Vi e N. 
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By the density of the family (x/), this intersection contains at most one 
point (and at least one, because the measure is positive). It follows that 
this point is an atom of /i. 

Exercise 1.8. Cantor’s middle third set can be obtained as follows: let 
C 0 = [0, 1], let Ci the set obtained from Co by removing the interval 
(1/3, 2/3), let C 2 be the set obtained from Ci by removing the intervals 
(1/9, 2/9) and (7/9, 8/9), and so on. Each set C„ consists of 2" disjoint 
closed intervals with length 3~ n , so that A(C„) = (2/3)" —> 0. If follows 
that the intersection C of all sets C„ is a closed and ^-negligible set. 

In order to show that C has the cardinality of continuum (at this stage 
it is not even obvious that C 7 ^ 0 !) we recall that numbers x e [0, 1] 
can be represented with a ternary, instead of a decimal, expansion: this 
means that we can write 

x = ^^< 2 ;3 - ' = 0, aifl 2 «3 • • • 

(>1 


with the ternary digits a, e {0, 1, 2}. As for decimal expansions, this 
representation is not unique; for instance 1 /3 can be written either as 0.1 
or as 0.0222..., and 2/3 can be written either as 0.2 or as 0.1222.... 
It is easy to check that Ci corresponds to the set of numbers that can 
be expressed by a ternary representation not having 1 as first digit, C 2 
corresponds to the set of numbers that admit a representation not having 
1 as a first or second digit, and so on. It follows that C is the set of 
numbers that admit a ternary representation not using the digit 1 : since 
the map 

OO 

(ci\, <32, . . .) e {0, 2} N ’ I —y X = y^fl;3~ f 

i=l 

provides a bijection of {0, 2}’ ! with C, and the cardinality of {0, 2\' : is 
the continuum, this proves that C has the cardinality of continuum. 

Exercise 1.9. Let {q„}neN be an enumeration of the rational numbers in 
[ 0 , 1 ], and set 

OO 

A [_J(d« ~ “2 ", q„ + -2 "). 

n =0 

Then A c M is open and k(A) < s2~ n ~ l = s (why is the inequality 
strict ?). Therefore [0, 1] \ A has Lebesgue measure strictly less than s 
and an empty interior, because [0, 1] \ A does not intersect Q. 

Exercise 1.11 Let {/„}„<=n be an enumeration of the open intervals with 
rational endpoints of (0, 1). By the construction in Exercise (1.9), for any 
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interval I and any 8 e (0, /-(f)) wc can find a compact set C C / with 
an empty interior such that 0 < A(C) < 8. We will define 

00 

E: =U C ‘ 

/=o 

where C n C /„ are compact sets with an empty interior, a(C„) > 0 and 
A(C„) < 8 n . The choice of C n and 8 n will be done recursively. Notice 
first that 

X(E n I n ) > HC n ) >0 Vne N, 

so we have only to take care of the condition X(E fl /„) < a(/„). Set 
p n = \ UqC,) and notice that > 0 because all C, have an empty 

interior. Since 

no o oo 

A(/„ fl £) < A(/„ n |^J C;) + 5/ = k(/„) — p„ + 5/ 

0 i = n +1 1=71+1 

it suffices to choose 8„ (and C„) in such a way that 8j < fi,,. This 
is possible, choosing for instance 8 n+ \ > 0 satisfying 

111 1 1 

8 n+l < max -p n , -p n - 1 , ..., ^Po , 


to get 8, < 2" 1 p n for i > n. 

Exercise 1,12. Let A be /z-measurablc and let B, C G be satisfying 
AAB c C and /r.(C) = 0. For any set D c X we have, by monotonicity 
of /x*, 

/i*(D n A) + /x*(D \ A) < /i*{D fi (B U C)) + /x*((D \ S) U C). 

Since g*(DnC) < /x*(C) = /x(C) = 0, by using twice the subadditivity 
of ix* and then the additivity of B we get 

/x*(D n A) + /x*(D \ A) < /x*(D DB) + /x*(D \ B) = /x*(D ). 

Since D is arbitrary, this proves that A is additive. 

Exercise 1.13. The statement is trivial if /x*(A ) = oo. If not, for any 
n e N* we can find, by the definition of /x*, a countable union A„ of 
sets of such that A n D A and /x(A„) < /x*(A) + I /n. Then, setting 
B := P) n A„ we have B D A and /x(B) < inf,, /x*(A) + 1/n = /x*(A). 
The inequality /x( B ) > n*(B) follows by the monotonicity of /x*, taking 
into account that /x*(B ) = /x(B). 
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Exercise 1.14. & is a a -algebra: stability under complement is immedi¬ 
ate, because A C AB C = AAB\ if A,- AB, C C,, then (1J ( A,)A(1J ( 6,) c 
U ( C,, and since /x-negligible sets are stable under countable unions, this 
proves that S ^ is stable under countable unions. 

The extension /x(A) := /x(5), where B e S is any set such that AAB 
is contained in a /x-negligible set of S, is well defined and er-additive on 
S' fi : if AAB c C and AAB' c C', then BAB' cCU C'\ consequently, 
if /x(C) = /x(C') = 0 it must be /i(B) = n(B'). The a -additivity can be 
proven with an argument analogous to the one used to show that S is a 
a-algebra. 

/x-negligible sets of S /A are characterized by the property of being con¬ 
tained in a /x-negligible set of S\ if A e S /t is /x-negligible, there exist 
/x-negligible sets B, C € S with AAB C C; as a consequence A is 
contained in the /x-negligible set B U C e S . Conversely, if A c X is 
contained in a /x-negligible set C e S we may take B = 0 to conclude 
that A e S^ and /x(A) = 0. 

Exercise 1.15. Let A be additive; by Exercise 1.13 we can find a set 
B e S containing A with jiiB) = /x*(A). The additivity of A and the 
equality /x*(B) = /x(B) give 

/x(fi) = /x*(A) + /x*(5 \ A). 

As a consequence /x*(5 \ A) = 0. Now we apply Exercise 1.13 again, to 
find a /x-negligible set C e S containing A\B. It follows that A AS is 
contained in C, and therefore A is /x-measurable. 

Exercise 1.16. Let us first build a family of pairwise disjoints sets {A,-},•<=/ 
C S? (N), with / and all sets A, having an infinite cardinality and A,- = 
N (the construction of the o -algebra will be more clear if we keep I and 
N distinct). The family {A,} can be obtained, for instance, through a 
bijective correspondence S between N x N and N, setting A, := S({/} x 
N). Then, we define jt : N —► I by 

7 r(n) = i, where i € I is the unique index such that n e A; 
and (with the convention tt _1 (0) = 0 ) 


& := {7r _1 (/) : / C /} . 


It is immediate to check that & is a a-algebra, that A,- = 7 r “ 1 ({*’}) G S 
and that any nonempty set in & contains one of the sets A,. Therefore 
& contains infinitely many sets, and all of them except 0 have an infinite 
cardinality. 
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Exercise 1.17. It suffices to define fi(A) = 0 if A has a finite cardinality, 
and +oo otherwise. A finite union of sets has an infinite cardinality if and 
only if at least one of the sets has an infinite cardinality, and this shows 
that /I is additive. 

The solutions of the next exercises require a more advanced knowledge 
of set theory, and in particular the theory of ordinals, the transfinite in¬ 
duction, the behavior of cardinality under unions and products, and Zorn 
lemma. We shall denote by co the smallest uncountable ordinal and by x 
the cardinality of continuum. 

Exercise 1.18. Notice that c cr(J8) implies 

|0 a *, rc : (aa) c ^ o) ’ b e ^ u) j c 

Therefore, if i is the successor of j, we obtain c a(X8)\ analog¬ 
ously, if i has no predecessor, and c cr(Jf) for all j e i, then 
U/ei namely is contained in cr(J8). Using these two facts, 
one obtains by transfinite induction that ^ (l) c rr(.yU) for all / e co. An 
analogous induction argument shows that C ^ (/) whenever i e j. 

So, the union °// := U /e = ft) is contained in a (%f) and, to prove 
that equality holds, it suffices to show that this union is a a -algebra. Let 
( B k ) c 1Z and let 4 G co be such that B k e .X'U Since 4 are countable 
and co is uncountable we have i := [_J 4 4 £ ® and all sets B k belong to 
■X (,) . It follows that their union belongs to A where j is the successor 
of i, and therefore to . An analogous (and simpler) argument proves 
that X/ is stable under complement. 

Exercise 1.19. Obviously 88 (K) has at least the cardinality of continuum, 
so we need only to show an upper bound on the cardinality of 88 (M). The 
proof is based on the fact that a union X-, and a product X i€j X t 
have cardinality not greater than x if the index set J and all sets A,- have 
cardinality not greater than y. Let .X (l> be defined as in Exercise 1.19, 
with J8 having at most the cardinality of continuum. Using the previous 
property of products, with J even countable, one can prove by transfinite 
induction that, for all i G co, ) has at most cardinality y. If we choose 
as -Xf the class of intervals, whose cardinality is (at most) /, we find 

88(E) = a(J8) = 

i€co 

Now we use the above mentioned property of unions, with J = co and 
Xj = ■X <1) , to conclude that 88(E) has at most the cardinality of con¬ 
tinuum. 
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Exercise 1.20. Obviously 2z? has a cardinality not greater than the car¬ 
dinality of ^(M); by Bernstein theorem (1 ) it suffices to show that the 
cardinality of & (M) is not greater than the cardinality of Jzf: if C is the 
Cantor set of Exercise 1.8, we know that fX (K) is in one-to-one corres¬ 
pondence of & (C), because C has the cardinality of continuum; on the 
other hand, any subset of C obviously belongs to Jz?, because C has null 
Lebesgue measure. 

Exercise 1.21. Let S c & ( X ) be a cr-algebra. Assume by contradiction 
that S is infinite and countable. We define the equivalence relation 

y ~ y' if and only if ((y e B <£> y' e B) VB e S) 

and let & be the partition of X in equivalence classes. We now prove that 
FP C S. Indeed, let F e &, fix / e F, for any x £ F we have / x 
so there must be B e S such that / e B and x £ B (or the opposite, but 
then we may consider B c ); given this set B, for any g € F we have that 
g ~ / implies g e B, so that F c B. Since x is arbitrary we conclude 
that 


f = n 


Be£,FcB 


Now, since S is countable, it follows that F e S. We eventually note 
that any set in S is union of sets in but then, if were finite then S 
would be finite, whereas if 'X were infinite then S would be uncountable. 

Exercise 1.22 We define as in the solution of the previous exercise, in 
this case it has finite cardinality, say n ; consequently, there are 2" sets in 
S. 

Exercise 1.23 We define ■'X as in the solution of Exercise 1.21; we also 
adapt the above argument to show again that ■'X C S. Indeed, let fe,?, 
fix / e F, for any x F we have f ^ x so there must be B = Bp x e S 
such that / e B and x £ B\ and again F C Bp, x - Hence 



xeX,x£F 


and this proves that C sF, since X is countable. We then use the 
Axiom of Choice to define a function 4> : & —X such that (p(F) e F, 
and eventually define /x = 


* 1 * If A has cardinality not greater than B, and B has cardinality not greater than A, then there exists 
a bijection between A and B 
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Exercise 1.24. We begin our construction with an algebra r 0 in P/ (N) 
and /z 0 : t o —► {0, 1} which is additive but not er-additive. For instance 
we may take as r 0 the algebra generated by singletons {*} with x e N 
(i.e. the sets A C N such that either A or A c are finite) and set 

[ 0 if A is finite; 

/z 0 (A) := < 

|l if A c is finite. 

We will extend /z 0 to an additive function, that we still denote by /z 0 , 
defined on the whole of ^(N). If such an extension exists, it can’t be 
or-additive, because /zo({n}) = 0 for all n e N, while /xo(N) = 1. 

In the class ^ of pairs (t, /z) with r algebra and /z : t —> {0, 1} 
additive, we define the partial order relation (r, /z) < (r', /z'j by r C 
r' and /zj T = /z; then we consider the class %\ ] of all (r, /z) satisfying 
(r, /z) > (to, /zo). By Zorn lemma, we can find a maximal (f, /z) in this 
class: indeed, it is easy to check that any totally ordered chain I C 
has an upper bound (r', /z'), defined by 

r' := r and /z'(A) := /z(A) where Aei, (r, /z) € I. 

(r,/i)s/ 


We will show that the maximality of (f, /z) forces f to coincide with 
& (N), so that \x will be the desired extension of /zo- 
Let us assume by contradiction that f C & (N) and choose Z C N 
with Z ^ f. We notice that 

{(AinZ)U(A 2 nZ c ) : Ai, A 2 ef} 

is the algebra generated by f U {Z}. Moreover, either Z or Z c satisfy the 
following property 

for all A e x with /z(A) = 1, Z n A ^ 0. (A. 13) 

If not, we would be able to find A\, A 2 e f with Aj fl Z = A 2 D Z c = 0 
and fi(Ai) = /z(A 2 ) = 1, so that A\ and A 2 would be disjoint and 
/z(Ai U A 2 ) = 2, contradicting the fact that /z maps f into {0, 1}. Possibly 
replacing Z by its complement we shall assume that Z fulfils (A. 13). 
Now we extend /z to the algebra generated by f U {Z}, as follows: 

/z(B) := jiz(Ai) whenever A\, A 2 e f and B = (Ai D Z) U (A 2 fl Z f ). 

" (A. 14) 

Let us check that /z is well dehned and additive. 
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1. /x is well defined: if 

b = {A\ n z) u (a 2 n z c ) = (A 3 n z) u (a 4 n z c ) 

then (Ai fi Z) = (A 3 fl Z), and if /x(Ai) ^ /x(A 3 ) then one of the 
two numbers, say /x(Ai), equals 1, while /x(A 3 ) = 0. Defining A := 
Ai \ A 3 we have /x(A) = 1 and A fl Z = 0, contradicting (A. 13). 

2. Suppose B, B' e f are disjoint. Let 5 = (Ai fi Z) U (A 2 D Z f ) 
and B' = (Aj n Z) U (Aj fl Z c ). Then A\ fl Aj fl Z = 0. Setting 
A" := Aj \ A\ we still have B' = (Aj' flZ)U (Aj fi Z c ), and then we 
can use the additivity of /x to conclude that 

jl(B U B') = fi(Ai U Aj) = £(A0 + /x(Aj) = £(fi) + £(£')• 

If 5 e r we can choose Ai = A 2 = fi in (A.14) to obtain that jl(B) = 
jx{B), so that /x extends jl to the algebra generated by f U {Z}. This 
violates the maximality of (r, /x). 

Exercise 1.25 We obviously need only to show that the cardinality of 
C is at least equal to the continuum. By the inner regularity of A we 
can assume with no loss of generality that C is closed. Now, we define 
A = (0, 1) \ C and 

g(t) := A([0, r]nc) re [0,1]. 

This continuous function maps continuously [0, 1] onto [0, A(C) |, and 
it is constant in any connected component of A, so that g(A) is at most 
countable. Since g(C) contains [0, A(C)] \ g(A) we obtain that C has 
cardinality at least equal to the continuum (one can actually see that 
g(C) = g([ 0, 1])). 

Exercise 1.26 Since K is totally bounded, for all e > 0 there exist fi¬ 
nitely many balls B\, ..., B N with radius e whose union covers K. The 
properties of /x imply the existence of an index i such that /x({« : x n e 
Bi}) = 1. Now we start with e = 1 and find a closed ball B (1) with 
radius 1 such that /x({« : x n € B n> }) = 1. Repeating this construction 
in B n> we find a closed ball B <2) with radius 1/2 contained in B n> with 
/x({« : x n e B a> }) = 1. Continuing in this way, if z is the common point 
of the balls B we find x„ /x-converges to z. 


Chapter 2 

Exercise 2.1 The verification is straightforward and is omitted. 
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Exercise 2.2 Let ip, i/s : X —> R be ^'-measurable. If <p(x) + i//(x) < t 
we can find a rational number r such that <p(x) < r and r//(x) < t — r, 
hence 

{<p + r/r < tj = [{ip < r] n {ip- < t - r}]. 

reQ 

This proves that <p + r/r is A-measurable. Analogously, since 

{<p 2 > a} = {<p > U {<p < — Va}, a > 0 

we obtain that (p 2 is measurable. Considering the difference ((p + i/0 2 — 
(<p — i/0 2 we obtain that <py'/ is A--measurable. 

Exercise 2.3. (i) The verification of the axioms of distance is immediate. 
In order to prove the compactness of R, let us consider a sequence (x n ) c 
R. If sup (J x„ = +oo we can find for any k an index n(k) such that 
x n (k) > k\ it follows that d(x„ k , +oo) = | arctan x n(k) — 7t/ 2| tends to 
0, so that x n( k) —>■ +oc in the metric space. Analogously, if inf, x n = 
—oo we can find a subsequence converging to —oo in (R, d). Finally, if 
both sup H x n and inf„ x n are finite, the sequence (x n ) is bounded and we 
can extract, thanks to the Bolzano-Weierstrass theorem, a subsequence 
x n (k) converging to x el The continuity of z arctan a implies that 
x„(k) x in (R, d). To prove the equivalence of the two topologies, 
let us work with closed sets: if C c R is closed with respect to the 
(R, d) topology, then it is closed with respect to the Euclidean topology, 
because \x n — x\ —> 0 implies | arctan x,, — arctan a | —»• 0. On the other 
hand, if | arctan x n — arctan x \ —> 0 then for n large enough arctan x n 
belongs to an interval I := (arctanx — s, arctanx + s) C (—tt/2, tt/ 2); 
the continuity of y ^ tan y in I implies that x n —> x. This proves the 
converse implication, and the equivalence of the two topologies. 

(ii) We notice first that, according to (i), 88 (R) and {—oo}, {+oo} belong 
to 88 (R). Therefore, if / is measurable between A and the Borel a- 
algebra of (R, d), then it is A-mcasurablc according to (2.2). According 
to the measurability criterion, in order to prove the converse implication 
it suffices to show that 88 ( R) is generated by ^(R) U {—oo} U {+oo}: 
this follows by the fact that if C C R is closed, then 

c = (C n R) u (c n {-oo}) u (c n {+oo}) 

(again by (i)) belongs to the algebra generated by 88 (R)U{—oo}U{+oo}, 
therefore the a-algebra generated by this family of sets contains 88 (R). 
Exercise 2.4. If {/ ^ g} is contained in a /x-negligible set C of A, 
for some A-measurable function g, then {/ > /} A{g > t) C C for all 
t e R, and since jg > (} e A it follows that {/ > t) e A^; this means 
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that / is (^-measurable. Conversely, assume that / is S' ■-measurable 
and find for all q e Q a set B q e <§ and a /x-negligible set C q e $ with 
{/ > q}AB q c C q . We define 

g(x) := sup [q e Q : x e B q ), C := [J C 9 . 

?eQ 

Since {g < f} = f] q<l B q we have that g is ^-measurable. Let us prove 
that fix) = g(x) for all x £ C : for any such x we have x e B q for 
all q < /(x), therefore g(x) > /(x); if the inequality were strict, there 
would exist q € Q with x e B q and q > /(x), therefore x would be in 
B q \{f >q}GC q C C. 

Exercise 2.5. If a < r we can find a nondecreasing family of partitions 
<j\,... ,cr n with <T| = a, <t„ = r and a i+l \ cr, containing just one point. 
Therefore, in the proof of the monotonicity of cr i—^ I„(f) we need only 
to show that I a (f) < I a u (?}(/) whenever t e (0, oo) \ cr. Let ct = 
{to, .... t!\j } and let i be the last index such that t, < t. If i < N we use 
the inequality 

(t i+ i - ti)f{t i+ 1) = (t i+ 1 - 0/(h+i) + (l - h)/(b+i) 

< (U +i - 0/fe+i) + (l - h)/(0 

adding to both sides £//;(0+i-0)/(0'+i) we obtain I a (/) < / CT u{ ? }(/)- 
If / = the argument is even easier, because the difference W)(/) - 
4 t(/) is given by (? - t N )f{t). 

Now, let /, g : (0, +oo) —>• [0, +oo) be given; since I a (f + g) = 
h(f) + Ia(g) we get 4(/ + g) < f(t) dt + g(t)dt. Since 
a e E is arbitrary, this proves that 


f 


fit) + g(t)dt < 


poo pc 

/ / {t)dt + / 

JO Jo 


git) dt. 


< 


f 

Jo 


In order to prove the converse inequality, fix L < f™ fit) dt, M 
fo° Sit ) dt and find cr, rj e E with I a (f) > L and /, 7 (g) > M; then 

fit) + git)dt > IaUrjif + g) = laUriif) + t<rUrjig) 

>IAf) + fig)>L + M. 

Letting L f / 0 °° /(f) dt and M \ / 0 °° g(?) dt the inequality is proved. 
Exercise 2.6. We will prove that /* is lower semicontinuous, the proof 
of the upper semicontinuity of f* being analogous. Let (x„) c 1 be 
converging to x and use the definition of /*(x„) to find y„ e 1R such that 


1 

y„ I < - 

n 


1 

and fiy n )<f*ix„) + ~. 

n 


158 Luigi Ambrosio, Giuseppe Da Prato and Andrea Mennucci 


Then (y n ) still converges to a:, so that 

1 

f*(x) < hm mf /(>'„) < liminf /*(x„) + - = liminf /*(x„). 

n—>oo n —>oo fj n—>oo 

Exercise 2.7. Let r e M and let (x n ) c {/* < f} be convergent to x. 
Then, the lower semicontinuity of /* gives 

/*(x) < liminf f*.{x n ) < t. 

n—>oo 

This proves that x € {/*<?}, so that {/* < t } is closed. The proof for f* 
is similar. Since the Borel a -algebra is generated by halflines, it follows 
that f* and /* are Borel, and the same is true for the set {/* = /*}, that 
coincides with E. 

Exercise 2.8. Set <p Q := <p, A 0 := {<p 0 > «o} and cp i := <p — aolx 0 — 
0. Then, set A\ := {^i > ai} and cp 2 := <p i — a\l A{ and so on. If 
ijo(x) = +oo then <p„(x) = +oo for all n, so that x belongs to all sets 
Aj and X7=o a <L ; (x) = +oo. We then assume that <p(x) < +oo in the 
following. By construction we have that 0 < <p l+ \ < (pi < ■ ■ ■ < (fio — (p, 
hence 

n n 

(p = <p n+ 1 + ^(<Pi - <Pi + 1 ) = <Pn +1 + XA'Lr 
i'= 0 i=0 

This proves that <p > a ( 4 A .. If the inequality were strict for some 

x e X with <p(x) < +oo, we could find e > 0 such that <p;(x) > s for 
all i e N, and since a { < e for i large enough, we would get x e A, for i 
large enough. But since the series a,- is not convergent, we would get 
ciit A (x) = oo, a contradiction. 

Exercise 2.9. Assume by contradiction that the absolute continuity prop¬ 
erty fails. Then, for some e > 0 we can find A; with /x(A ; ) < 2~‘ and 
f A \cp \ dpi > s. It follows that the set B := limsup,- A, is /x-negligible, 
and 

B„ := 1J Ai\B i 0. 

i >n 

Since f B \(p\ dpi > f A \(p\ dpi > e we find a contradiction with the dom¬ 
inated convergence theorem applied to the functions t B \(p\, pointwise 
converging to 0. 

Exercise 2.10. Let e > 0 be given and let S > 0 be such that f A \ip\ dpi < 
ell whenever A e S’ and /x(A) < <5. The triangle inequality gives, 
with the same choice of A, f A \(p n \ dpi < e for n > n 0 , provided || ip n — 
(p\\i < e/1 for n > n 0 . Since (pi,, ip no are integrable, we can find 
Si > 0 such that f A \<pj\ d/i < e whenever A e S and pi(A ) < 8,. If 
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<$ 0 = min{<5, min, <$,}, we have f A \q>„ \ d/i < e/2 whenever n e N, A e S’ 
and /x(A) < S. 

A possible example for the second question is £2 = [0, 1], /z = A the 
Lebesgue measure, and <p„ The uniform integrability is a 

direct consequence of the convergence of <p„ to 0 in L 1 . If <p„ < g, then 

OO CO 

= - 8 
n =1 n= 1 

but / ESE <Pn = ET !/« = +°°- 
Exercise 2.11. (a) For any y e X we have 

gd*) < g(y) + Ad(x, y) < g(y) + Ad(x\ y) + Xd(x, x'). 

Since y is arbitrary we get g x (x) < g A (x') + kd(x, x'). Reversing the 
roles of x and x' the inequality is achieved. 

(b) Clearly the family (g A ) is monotone with respect to X, and since we 
can always choose y = x in the minimization problem we have g x (x) < 
g(x). Assume that sup A g,(x) is finite (otherwise the statement is trivial) 
and let x x such that g x (x) + A -1 > g(x x ) + kd(x, x x ). This inequality 
implies that x x —»• x as A —> oo and, now neglecting the term A d{x, x x ), 
that 

1 

gx(x) + - > g(x x ). 

Passing to the limit in this inequality as A —> oo and using the lower 
semicontinuity of g we get sup^ g x (x) > g(x). 

Exercise 2.12. Let us first assume that / is bounded. For e > 0 we 
consider the functions 

1 f x+e 

fe(x,y):=— / /(x , y) dx'. 

2s Jx-s 

Since x m>- /(x, y) is continuous, we can apply the mean value theorem 
to obtain that / £ (x, y) —> /(x, y) as s / 0. So, in order to show that / 
is a Borel function, we need only to show that f s are Borel. 

We will prove indeed that f e are continuous: let x„ —>• x and y„ —> y; 
since /(x', y n ) —> /(x', y) for all x' e M, we have 

1 [Xn _ £iXn+s] (x')f(x', y„) -► %_ £ ^ +£] (x')/(x', y) 

for all x' e M \ {x — e, x + e}. Therefore, since / is bounded, the 
dominated convergence theorem yields 

fe(x,y) = XT f kx-e,x+e](x')f(x',y)dx' 
x-s Jr 

= x~ lim f kxn-e,xn+si(x')f(x',y n )dx'= lim f E (x n ,y n ). 
2S n^oo n-f-o o 
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In the general case when / is not bounded we approximate it by the 
bounded functions f h (x ) := max{—//, min{/(x), h}}, with h e N, that 
are still separately continuous, and therefore Borel. 

Chapter 3 

Exercise 3.1. On the real line, endowed with the Lebesgue measure, 
the function (1 + |x|) _l belongs to L 2 , but not to L 1 , and the function 
|x| _1/,2 l (0 !>(jc) belongs to L l , but not to L 2 . Turning back to the general 
case, if i p e L pt D L P1 with pi < p 2 , from the inequality 

\ip\ p < max{|^| pi , \cp \ p2 } < \<p\ p ' + \(p\ P2 Vp e [pi, pi] 

(that can be verified considering separately the cases \cp\ < I and \ (p\ > 1) 
we get that <p e L p for all p e [pi, p 2 ]. 

Exercise 3.2. The statement is trivial if \\f\\ q = 0, so we assume that 
\\f\\ q > 0. For e > 0 the set X € := {|/| > f} has finite /x-mcasure, by 
the Markov inequality, hence the inclusion between U spaces for finite 
measures gives that |/|i Xt e L P (X, S’, /i). Since the dominated conver¬ 
gence theorem gives 

lim f \f - fl x fdp, = lim [ \f\ q dp. = 0 

ef° Jx e f 0 Jx\x e 

we can choose / = ft x for e > 0 small enough. 

Exercise 3.3. By homogeneity we can assume that \\(p\\ p = 1 and || \j/ || v = 
1. Since 

/•('Mf + hT_ HW f^ = wh + H4_ 1=0 

A V p q ) p q 

and the function among parentheses is nonnegative, it follows that if van¬ 
ishes /x-a.e. In particular, for /i-a.e. x, \<4>(x)\ is a minimizer of 

yl 

y ^ : — \if(x)\y 
q 

in [0, Too). But this problem has a unique minimizer, given by \ifr(x)\ q ~ l , 
and we conclude. 

Exercise 3.4. It suffices to apply Holder’s inequality to the functions \(p\ r 
and \ifr\ T , with the dual exponents p/r and q/r, to obtain 


\Wt\\ r r < IIISPriL/rlll^ril«/r = 1^11^11^11^ 
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Exercise 3.5. The positive part and the negative part of <p — (p„ have the 
same integral, hence 

/ \<p - <p n \ dpi = 2 / (<p - cp n ) + dpi. 

Jx Jx 

The condition lim inf„ tp n > ip ensures that (cp — (p n ) + is pointwise conver¬ 
gent to 0; in addition, since ip n are nonnegative, the functions are dom¬ 
inated by (p + . Therefore the dominated convergence theorem gives the 
result. 

Exercise 3.6. If x// n —> \jj /x-a.e. we apply Fatou’s lemma to the functions 
i//„ + i p n to obtain 


lim inf 

n—>oo 


L 


tyn + <Pn > 


L 


ij/ + cp dfi. 


Therefore 


limsup / xfr,, dpi + liminf / (p n dfi > / <pd/i+ / 

n —^oo Jx n^-oo J x J x J x 


\l/ da. 


Subtracting f i/rdfA from both sides the statement is achieved. In the 
general case, let n(k ) be a subsequence such that I ini/- / (p n (k) d[i = 
liminf n f x <p n , and let n(k(s )) be a further subsequence converging to 
i p /x-a.e. Then 


lim inf 


/ cp„dfi = lim / <Pn(k( S )) d/i > / liminf (p n(k{s)) dpi 

Jx S ^°°J X Jx n 


> 


L 


lim inf <p n d/i. 

n—¥ oo 


Exercise 3.7. We show only how (3.13) implies g(tx + (1 — t)y) < 
tg{x) + (1 — t)g(y) for all x, y e J and t e [0, 1], We prove first, by 
induction on m, that 



for all x\, ..., a' 2 »> e J. The case m = 1 is (3.13) and the induction step 
can be achieved grouping the terms as follows: 


E 



1 

2 



1 

^rn—l ^i 


2 m ~ 1 


+ E 


—^X 2 m-l+i 
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Now, considering the case when x t = x for I < i < k and x,- = y 
otherwise, we get 

k 

g(tx + (1 - t)y) < tg(x) + (1 - t)g(y) with t = —. 

Since g is continuous, by approximation we get g(tx + (1 — t)y) < 
tg(x) + (1 — t)g(y) for all x, y e J and t e [0, 1], 

Exercise 3.8. Let us first show the existence of zo- Let A = g (M) and 
let u„ = g(z n ) with u n l inf A. Since u„ is uniformly bounded from 
above, our assumption on g ensures that (z. n ) is bounded. By the Bolzano- 
Weierstrass theorem we can hnd a subsequence z n (k) convergent to z e 
M. The continuity of g gives that u n(k) = g(z n (k )) converge to g(z). It 
follows that inf A is finite and coincides with g(z). Now, by applying the 
convexity inequality of the previous exercise with x = zi, y — Zo and 
t = (zi- Zo)/(zi ~ zo), we get 

g(z 2 ) - gfei) > g(zi) - g(zo) > 

Z 2 — Zi z\ — Zo 

for zo < Z\ < Z. 2 , proving the monotonicity of g in [z 0 , +oo). The 
argument in (—oo, zo] is analogous. 

Exercise 3.9. Fatou’s lemma gives liminf,, f <p n d/i > f lim inf,, <p n d/i > 
f ipdjx. Therefore t n := f (p n dfi —> t := f cpd/i] we can apply Exer¬ 
cise 3.5 to the functions cp n /t n to obtain that <p„/f„ —> <p/t in L l . From 
this, taking into account that t n —> t, the convergence of <p n to ip in id 
follows. 

Exercise 3.10. LeUh(c) := d>(c)/c and notice that |^, | < cd>(|^,|)/<I>(c) 
= on {\<pi\ > c}. Therefore 

f f Od^-I) f M 

/ \<Pi\dii< / dn+ / \(Pi\dfi < —-+c/r(A). 

JA J An{\<pi\>c) ^ \ c ) JAn{\ Vi \<.c) 

Let us choose c sufficiently large, such that M/ty(c) < e/2, and then 
8 > 0 such that c8 < e/2. The inequality above yields f A \cp, \ dji < e 
whenever /i(A) < 8. 

Exercise 3.11. Let (/„) c Ci,(X) be converging in L 1 to /, and let f n(k) 
be a subsequence pointwise convergent fi-a.e. to /. Then, given any 
e > 0, by Egorov theorem we can hnd a Borel set B c X with /i(B) < e 
and /„(*) —> / uniformly on B‘ . By the inner regularity of the measure 
we can hnd a closed set C C B c such that ii(X \C) < e. The function / 
restricted to C, being the uniform limit of bounded continuous functions, 
is bounded and continuous. 
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Chapter 4 

Exercise 4.1. Notice that (-, •) is obviously symmetric, that (x, —y) = 
— (x, y) = {—x, y) and that (x, x) = ||x|| 2 > 0, with equality only if 
x = 0. Notice that the parallelogram identity gives 

II* + *' + 2v|| 2 + ||* - x'|| 2 = 2 ||jc + v|| 2 + 2\\x’ + y\\ 2 

= 8(x, y) + 8<x', y) - 2\\x - y|| 2 - 2||x' - y|| 2 


and 

II* + *' - 2y|| 2 + ||x - x'|| 2 = 2||x - y|| 2 + 2||x' - y|| 2 

= 8(x, -y) + 8(x', -y) - 2||x + y|| 2 - 2||x' + y|| 2 . 

Subtracting and dividing by 4 we get 

(x + x\ 2y) = 4(x, y) + 4(x\ y) - 2{x, y) - 2(x', y). 

So, we proved that (x + x', 2y) = 2{x, y) + 2(x', y). Using the relation 
(u, 2v) = 4(u/2, v) (due to the definition of (■, •) and the homogeneity 
of || • ||), we get 

x + x' \ 1 1 , 

Setting x — t\V, x’ — tiv, and defining the continuous function = 
(tv, y), we get 

{t[ -\~ tj\ 1 1 

<!>(—) =- 2 0(h) + -2<Hh). 

This means that </> and —cp are convex in M, so that </> is an affine function, 
and since 0(0) = 0 we get 0(f) = f0(0), i.e. (tu , y) = t(u, y). Coming 
back to the identity above, we get (x + x', y) = (x, y) + (x', y). 
Exercise 4.2. Assume that y = tt k (x). For all z G K and t e [0, 1] we 
have y + t(z — y) belongs to K, so that 

\\y + t(z - y)-x || 2 > ||y - z\\ 2 - 

Expanding the squares we get 

t 2 \\z - y|| 2 + 2 t(z - y, y - x) > 0 Vf e [0,1], 

This implies (either dividing by t > 0 and passing to the limit as t | 0, 
or computing the right derivative at t = 0) that (z — y, x — y) < 0. 
Conversely, if for some y e K this condition holds for all z £ K, the 
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argument can be reversed to get \\y + t{z — y) — x\\ > ||_y — x|| for 
all t > 0. Choosing t = 1 we get ||z — x|| > ||y — x||, proving that 

y = Xk(x). 

Exercise 4.3. Let Y k be the vector space spanned by { j), , f k ) and 

let us prove by induction on k > 1 that /,• is orthogonal to fj whenever 
1 <i < j <k. First we observe that if this property holds for some k, 
then Y k is ^-dimensional and coincides with the vector space spanned by 
{iq, ..., v k \ (being contained in it, and with the same dimension). 

The orthogonality of the vectors /, can be obtained just noticing that 



1=1 


So, f k = v k — jry /( I (v k ) is orthogonal to all vectors in Y k _\. It follows 
that {e k , e.j) = 0 for all i < k. 

Exercise 4.4. Let y = x — (x , e k )e k , we know that the series converges 
in H by Bessel’s inequality. In order to show that ^ k (x, e k )e k = 7 r x {x) 
it suffices to prove that y is orthogonal to all vectors in X. But since 
any vector v e X can be represented as a series, it suffices to show that 
( v , e-,) = 0 for all i. The continuity and linearity of the scalar product 
give 


OO 


(y, et) = (x, et) - ^{x, e k )(x, a) = (x, e,-> - (x, e ,-) = 0. 


Exercise 4.5 Since X and its scalar product coincide with L 2 ([0, 1], 
<^([0, 11), fi), where /x is the counting measure in [0, 1], we obtain 
that X is an Flilbert space. Let us prove by contradiction that X is not 
separable. If S = {/„}„> i were a dense subset, it could be possible to 
find a countable set D C [0, 1] such that /„ (x ) = 0 for all n and all 
x e [0, 1] \ D. Since [0, 1] is not countable we can find xo e [0, 1] \ D 
and define go(x) equal to 1 if x = xo and equal to 0 if x ^ xq. We 
claim that go does not belong to the closure of S. If this property fails, 
we can find a sequence (f n (k)) C S convergent to go /x-a.e. in [0, 1]; 
but, convergence /x-a.e. corresponds to pointwise convergence and since 
g 0 (x 0 ) ^ 0, while f„(k)(x o) = 0 for all k, we obtain a contradiction. 
Exercise 4.6. By Parseval identity we know that x m>- ((x,e,)) is a 
linear isometry from H to lj- Asa consequence, taking the parallelogram 
identity into account, the scalar product is preserved. 

Exercise 4.7. We consider the class of orthonormal systems {c, }, e / of H , 
ordered by inclusion. Zorn’s lemma ensures the existence of a maximal 
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system {e,-},<=/. Let V be the subspace spanned by e,-, let 7 be its closure 
(still a subspace) and let us prove that Y = H. Indeed, if Y were a proper 
subspace of H. we would be able to find, thanks to Corollary 4.5, a unit 
vector e orthogonal to all vectors in 7, and in particular to all vectors 
Cj. Adding e to the family {e, }, 6 / the maximality of the family would be 
violated. Now, by the just proved density of V in H, given any x e H 
we can find a sequence of vectors (v„), finite combinations of vectors e,-, 
such that H* — v„ || —> 0. If we denote by J„ c / the set of indexes used 
to build the vectors {iq, ..., v n }, and by H n the vector space spanned by 
{e,} ieJ „, we know by Proposition 4.6 that 

||* - J>, e,-)e,-|| < \\x - v n \\ ->■ 0. 
i£j n 

As a consequence, setting J = U n J n , we have X = £/€/(*>«/>«!■• 

Chapter 5 

Exercise 5.1. The functions sin mx cos lx are odd, therefore their integral 
on (—71,7r) vanishes. To show that sin mx is orthogonal to sin lx when 
/ ^ m, we integrate twice by parts to get 

r 

I sinmxsinlxdx 


The integrals of products cos mx cos lx can be handled analogously. 
Exercise 5.2. Since for N < M we have 

N M M oo 

ii ^2 Xn - X ! Xn ii - XI n^'ii- X n^'ii 

n= 0 n= 0 i=N+\ i=N +1 

we obtain that (X^ x i) i s a Cauchy sequence in E. Therefore the com¬ 
pleteness of E provides the convergence of the series. Passing to the 
limit as N —► oo in the inequality || x, || < |x, || and using the 

continuity of the norm we obtain (5.15). 

Exercise 5.3. We consider only the first system gk = sin kx, the 

proof for the second one being analogous. The fact that ( g k ) is orthonor¬ 
mal can be easily checked noticing that g k are restrictions to (0,7r) of 
odd functions, and using the orthogonality of sin/:* in L 2 (—tt, tt). Ana¬ 
logously, if / e L 2 (0, Tt) let us consider its extension / to (— n, n) as an 


= — / cos mx cos lx dx 


/ 

J —1 

m 2 r 

~pL 


sin mx sin lx dx. 
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odd function and its Fourier series, which obviously contains no cosinus. 
In (0,7r) we have 


N 


N 


I] ^ sinfcx = £]</, gk)glc, 
k=\ k= 1 


where the scalar products are understood in L 2 ( 0, n). Therefore, from 
the convergence of the Fourier series in L 2 (—tt, jt) to /, which implies 
convergence in L 2 (0, jt) to /, the completeness follows. 

Exercise 5.4. Clearly (e k , e k ) = 1, while 



e ikx e~ ilx dx = 


1 


i (k — I) 


i(k—l)x 


dx 


= 0 


whenever k ^ /. 


As a consequence ( e k ) is an orthonormal system. 

Since the Fourier series .S' v / = ef)ek of / depends linearly on 

/, in order to show completeness we need only to show SV/ —> / when 
/ is real-valued and when / is imaginary-valued (i.e. if is real-valued). 
We consider only the first case, the second one being analogous. Setting 
Ck - (f,e k ), we have 

i r 

c k = _ / f(x) cos kx — if(x ) sinkx dx. 

V 2,7T Jjt 


As a consequence, for k > 1 we have -J 2 jjic k = a k — ib k , where a k and 
b k are the coefficients of the real Fourier series of /, and for k < —1 we 
have y/2/jtCk = a- k + ib- k . For k = 0, instead, we have ^/2/jtco = ao- 
Taking into account these relations and setting bo — 0, we have 


J2 Ck 



(cos kx + i sin kx)(a k — ib k ) 


ilS 

+ ^ (cos kx + i sin kx)(a- k — ib - k ) | 
k=-N > 

+ Rel ^^(cos kx + i sin kx){a k — ib k ) ) 


ao 

~2 

a 0 


N 


+ a k cos kx + b k sin kx, 


k= 1 


and the convergence of S^f to / follows by the convergence in the real¬ 
valued case. 
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Exercise 5.5. It suffices to note that 

2tt (/ fW e ~ lkX = ((/> e k)) 2 , 

where (e k ) is the orthonormal system of Exercise 5.4 and to use its com¬ 
pleteness. 

Exercise 5.6. From the identity Yif=o e ‘ kz = (e lilN+V)z — I )/(<?'" — 1), 
we get 


N 


Y e ik * = e- iNz Y el 


2^ J(2N+\)z _ 1 

kz _ „-iNz e 1 


= e 


k=-N 


k =0 


e iz - 1 


e i(N+\/2)z _ e -i(N+l/2)z sin((A ^ + 1//2 ) Z ) 


e 17 -!- — e~ iz l- 

and we call this term Gn(z ). Hence 

N 


sin(z/2) 


(A. 15) 


s " m = ( E ^ d y)‘‘ 

N l f 71 

J2 -r / f(yy k(x ~ y)d y 

l- —— A! ^ J-7T 


k=—N 

= 'Lt j 7(j)G;v(x - y)dy. 

Using the fact that sin((A + l/2)z)/ sin(z/2) has, still because of (A. 15), 
mean value 1 on (—it, tt), we get 

f(x) - S N f(x ) = ^ J (/CO - f(y))G N (x - y ) dy. 


Exercise 5.7. We apply the Parseval identity to the function fix) = 
x 2 , whose Fouries series contains no sinus. It is simple to check, by 
integration by parts, that a 0 = 27i 2 /3 and that a k = 4k 1 cos kx for 
k > 1. We have then 


1 

7T 




T + E' 


k=l 




k= 1 


16 

¥' 


Rearranging terms, we get k 4 — 7T 4 /90. 
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Exercise 5.8. The polynomials P n are given by Qn/WQnWi, where Q n 
are recursively defined by Q 0 = I and 

n— 1 


k=0 

(a) Since Q 0 = 1, Po = \/y/2 and Q { = x — (x, Pq) Po = x, because 
(x, Pq) = 0. As a consequence P\{x) = y/3/2x. Since (x 2 , Pi) = 0, we 
have also 


P k )P k (x) Vn > 1. 


n —1 




Q 2 (x) = X 2 - <x 2 , Po)Po - <x 2 , Pi)Pi = x 2 - - 

and this leads, with simple calculations, to P 2 (x) = V45/8(x 2 — 1/3). 

(b) Let H be the closure of the vector space spanned by C n . This space 
contains all monomials x n , and therefore all polynomials. Since the poly¬ 
nomials are dense in C(\a, b\), for the sup norm, they are also dense in 
L 2 (a, b). It follows that H = L 2 {a , b). By Proposition 4.13 we conclude 
that (C„) is complete. 

(c) Set 


Zn • — 


2 n + 1 1 

2 iFriV 


Pn(x ) := Zn-J—(x 2 - 1 )" 

d"x 


Clearly the polynomial P n has degree n. So, in order to show that P n = 
P n , we have to show that P„ is orthogonal to all monomials x k ,k — 
0,— 1, and that || P„ || 2 = 1. Since P„ has zeros at ±1 with multipli¬ 
city n, all its derivatives at ±1 with order less than n are zero. Therefore, 
for k <n we have 


{P n ,X k ) = Z n 


jn-l 

x*±— ( x 2 -i r 
d n ~ l x 


71-1 


1 f 1 

— k I x k ~ l - 7 —{x 2 — \) n dx 

_i J-i An - X - 


d 

k-l a 


= (-1 fk\Zn 


r jn—k 


d n ~ k x 


(x 2 - 1)" 


= 0. 


In order to prove that || P n || 2 = 1, still integrating by parts we have 

>1 jn-\ d n+i 


( Pn,Pn) = 


(x 2 -1 r 


2 f 

1,1 J-l d"~h 

of' O d 2n , 

= zl 3_i {l - x)n dZ {x irdx - 


(x 2 - 1)" dx = 


(A. 16) 
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On the other hand 

(1 — x 2 ) n dx = 2n I (1 — x 2 ) n ~ l x 2 dx 


/: 


/: 




= -2 n I (1 - x 2 ) n dx + 2n I (1 - x 2 ) n ~ 1 dx. 


/:< 


so that 


mL 


2 n 
2 n + 

(2 n)\\ 
(2 n + 1)! 


x 2 )" 1 dx = 


9 n 2(2/?)!! 

(1 — x 2 ) 0 dx = -. 

(2/i + l)!! 


Taking into account that 

d 2n 


(x 2 - 1)" = (2n)\ = (2n)!!(2n - 1)!! = 2"n!(2n - 1)!! 


d 2n x 

from (A. 16) we get 


- ~ 2n + 1 1 2(2/?)!! „ 

(P„, P„) =----——— 2 n\{2n - 1)!! = 1. 

' 2 2 2n (n!) 2 (2n + 1)!! 


Exercise 5.9. Recall that 

1 f 11 

Ck = — / f(x)e~ lkx dx. 

2n J-n 

Integrating by parts once and using that /(—7r) = f(n) we get 



Continuing in this way, in m steps we get 


Ck 


1 1 

(ik) m 2?x 



f (m \x)e~ ikx dx. 


Chapter 6 

Exercise 6.1. Let us prove the inclusion 

(J£"i x & 2 ) x ^3 C x (,^2 x ^3), 
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the proof of the converse one being analogous. We have to show that all 
products A x B, with A e &\ x ^ 2 and B e 3P 3 belong to &\ x 
(AP 2 x ^ 3 ). Keeping B fixed, the class of sets A for which this property 
holds is a a -algebra that contains the 7 T-system of measurable rectangles 
A = A] x A 2 (because Ax B = Aj x (A 2 x B) and A 2 x B e AF 2 x ^ 3 ), 
and therefore the whole product cr-algebra .>T 1 x J^ 2 . 

For all A in the product ct- algebra we have 


(Mi x M 2 ) x h 3 (A) = 


/ 

JXtxX 

/ / 

JXi ./x 


M3(4v,x 2 )^1 X M2(X|, V 2 ) 

ft3(Av 1 j C2 ) dll 2 (x 2 ) dudxi) 



M2 x M 3 (A Al ) </mi(*i) = Mix (M2 X M 3 )(A). 


X n 

! <^(M), and thanks to 

Lemma 6.9 the same is true for the open sets. It follows that ^(K") is 
contained in X { 38 (M). Let us consider the class 

l:={ficR : fixRx-xKe 38 (R n )}. 

This class contains the open sets (because the product of open sets is 
open) and it is a a -algebra, so it contains 38 (R). We have thus proved 
that all rectangles B 1 x M x • • • x M, with B\ Borel belong to 38 (K' ! ). By 
a similar argument we can show that all rectangles 

Rx'-xRxBjxRx'-'xR 

are Borel. Intersecting rectangles in these families we obtain that all 
rectangles with Borel sides belong to 38 (W 1 ) and we conclude. 

Exercise 6.3. Assume that A, B e .if,; then there exist Borel sets 
A 1 , B' and Borel Lebesgue negligible sets N A , N B with AAA ' c Na 
and BAB' C N B . Since A' x B' e 38 (R 2 ), by the previous exercise, 

(A x B)A(A' x B') c ( N a xl)U(lx N b ) 

and N a x M and lxiV s are 3£ 2 negligible, we obtain that Ax B e .5? 2 . 
This proves that Jz ? 2 contains the generators of .if, x .if,, and therefore 
the whole cr-algebra. In order to show the strict inclusion, we consider 
the set E = F x {0}, where F c M is not Lebesgue measurable. Since 
E is .^-negligible we have E e 3£ 2 - O ' 1 the other hand, since the 0 
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section E° coincides with F, and therefore it does not belong to the 
set E can’t belong to the product of the two a-algebras. 

Exercise 6.4. Let si be the a -algebra generated by these sets; since these 
sets are obviously cylindrical, si is contained in the product a-algebra. 

v. n 

The class of sets Be X x X, such that B x X n+ \ x X n+ 2 x • • • e si 
is a a -algebra containing the measurable rectangles A\ x ■ ■ ■ x A n , and 

v, n 

therefore contains the product a -algebra X , , 5 ^. Therefore si contains 
the cylindrical sets and, by definition, the whole product a -algebra. 
Exercise 6.5. The sections T y := {(v, z) : (x,y,z) e T) are squares 
with length side 2-Jr 2 — \y\ 2 for 0 < |y| < r, hence 

2?\T) = j 1 J? 2 (T y )dy = 8j\r 2 -y 2 )dy = 8(r 3 - K 3 ) = ^r 3 . 


Exercise 6.6. For x e M" (with n > 3) let 

r := (x 2 + x?) 1/2 , A r := {(x 3 , ..., x n ) : (x 3 2 H-h x 2 ) < 1 - r 2 } . 

Then, using polar coordinates we get 


/ 

J ir 


a>„ = I S£ n {A r ) dx\dx2 — 2n(s n - 
'{«■<!} 

2tt 

= - (On- 2 - 


« f 

Jo 


r(l — r 2 ) ( ” 2)//2 dr 


Therefore 


2 k ~hT k ~ l 7t k 

M2k = 2k{2k — 2) • • • 4 M2 = ~k\ 


and an analogous argument gives u> 2 k+\ = 2 k+] u k /(2k + 1)!!. 


Exercise 6.7. In order to show that co n = 


T n/2 


we show that the right 


r (2 + b 

hand side satisfies the same recursion formula of the previous exercise. 
Since (thanks to the identities T(l) = 1, T( 1/2) = y/n) the formula 
holds when n = 1, 2, this will prove that the identity holds for all n. For 
n > 2 we have 


ir n ' 2 _ jz • Tt^-V' 2 _ 2n it( n ~ 2 d 2 

r(f+ D _ fryf) _ ~n r(^ + i)‘ 

Exercise 6.8. We know, by Exercise 2.4, that there exist a A.-negligible 
set fV e x and a ■£ x (^-measurable function F : X x Y —>• [0, +oo] 
such that {F £ F} is contained in N. By applying the Fubini-Tonelli 
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theorem to we obtain that N x is v-negligible in Y for /x-a.e. x e X. 
Since {F{x, •) ^ F{x, •)} C N x , still Exercise 2.4 gives that F(x, •) is 
v-measurable for /x-a.e. x € X. This proves statement (i). Since, still for 
/x-a.e. x e X, the integral on Y (with respect to v) of F (x , •) coincides 
with the integral of F(x, •), statements (ii) and (iii) follow by applying 
the Fubini-Tonelli theorem to F. 

Exercise 6.9. Indeed, fi(D y ) = /x({y}) = 0 for all y e Y, so that 
f Y /x(Dy) dv(y) = 0. On the other hand, v(D x ) = v({.v}) = 1 for all 
x € X, so that f x v(D x ) d/i(x) = 1. 

Exercise 6.10. Let (h(k)) be a subsequence such that || — f\\ j is 

convergent. Then the Fubini-Tonelli theorem gives 






It follows that J2 k II fh(k) (-L 0 — f(x, OIIl^v) i s finite for /x-a.e. x e 
X, and for any such x the functions //,(*)(x, •) converge to / in L l (v). 
Choosing Y = {y} and v = to provide a counterexample it is suffi¬ 
cient to consider any example (see Remark 3.7) of a sequence converging 
in L 1 but not /x-almost everywhere. 

Exercise 6.11. It suffices to apply (6.15) to \h\ to show that f \h\ df /x is 
finite if and only if f \h\f d/i is finite. 

Exercise 6.12. We prove the property for the sup, the property for the inf 
being analogous. If A = B\ U B 2 with B\ e & and B 2 e & disjoint, we 
have 



J B\ J fii 



The arbitrariness of this decomposition, proves that [(//x) V (g/x)](A) < 
(f vg)ii(A). The converse inequality can be obtained noticing that, in the 
chain of equalities-inequality above, the inequality becomes an equality 
if we choose B\ = A (T {/ > g} and B 2 = A H {f > g}. 

Exercise 6.13. It is easy to check that /x < /x,- (respectively, /x > /x, ) for 
all is/, and that any measure v with this property is less than /x (resp. 


173 Introduction to Measure Theory and Integration 


greater than /z): just write v(B) = J2k v( ~ B k) < E/, l^i(k)( B k) (resp. 
— Ex l l nk)(Bk)- So, it remains to show that fi and JI are cr-additive. 
For any map i : N —> /, A\, A 2 e & disjoint and any countable im¬ 
measurable partition of A\ U A 2 we have 

00 00 00 

£/*'«(**) = n A]) + E Bi(k) (Bk n A2). 

Jr=0 X=0 k=0 

Estimating the right hand side from below with /z(Ai) + /z(A 2 ) we get 
(because (BO is arbitrary) that /z is superadditive, i.e. /z(Ai U A 2 ) > 
/z(Ai) + h(A 2 ). With a similar argument one can prove not only that JI 
is subadditive, but also that JI is a -subadditive (it suffices to consider a 
countable ^"-measurable family, instead of 2 sets). 

Now, let us prove that /z is subadditive and JI is superadditive. Let 
A], A 2 e -JA be disjoint and let B\, Bj: be countable ^"-measurable 
partitions of A\ and A 2 respectively. If i\, i 2 : N —>■ I we define i(2k) = 
i\(k), B 2 k = B l k and i(2n + 1) = i 2 (n), B 2 k+\ = B k , so that 

OO OO OO 

/z(Ai U A 2 ) < ^ i-ij(k)(B k ) = ^ Bi 2 (k)(B k ). 

k= 0 t=o x=o 

By the arbitrariness of B l k , B k , i\ and i 2 we conclude that /z(Ai U A 2 ) < 
/z(Ai) + /z(A 2 ). With a similar argument one can prove that /i is even 
a-subadditive (one has to use a bijection between N x N and N) and that 
JI is superadditive. 

Exercise 6.14. If for all s > 0 there exists S > 0 satisfying 

A e JP, 11 (A) < 8 => v(A) < s 

then v fi\ indeed, if /z(A) = 0 the implication above holds for all 
e > 0, hence v(A) = 0. If v is finite, to prove the converse we argue by 
contradiction. Assume that, for some Sq, we can find sets A n e with 
/z(A„) < 2~ n and v(A„) > e 0 . Then, by the Borel-Cantelli lemma the 
set A := lim sup ;) A n is /z-negligiblc. On the other hand, we have 

OO 

v(|J A '») - - £ o 

m=n 

and therefore (here we use the assumption that v is finite) v(A) > e (l , 
contradicting the absolute continuity of v with respect to /z. 

Exercise 6.15. Let be a /z-negligible set where v is concentrated. 

Then v(E) — v(E (T B) for all E e AF. But, by the absolute continuity 
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of v with respect to /x, we have v(E D B) = 0 because E D B C B is 
/r-negligible. 

Exercise 6.16. Let fie# be a v-negligible set where a is concentrated. 
Then 

a{E) = a(E n B) < /x(£ (T B) + v(£ Hfi)< /x(£) V£ e J* - , 

where we used the fact that v(E D B) = 0 because E n B c 5 is v- 
negligible. 

Exercise 6.17. It is easy to check that the class of functions / satisfying 
//x < v is a lattice. Hence, given a maximizing sequence (//,) in (6.20), 
possibly replacing //, by max,<„ f, , we can assume that //, f /. The 
monotone convergence theorem gives that / is a maximizer. 

In order to show that v = f /jl we set a = v — f /i > 0 and notice that 
a satisfies the following property: 

t > 0, fie #, tl B fi < a => /x(5) = 0. (A.17) 

Indeed, the integrals f x (f + fl B ) dn and f x f d/i have to coincide, be¬ 
cause (/ + tXB)B- < v. 

Exercise 6.18. We have to prove that any measure cr satisfying (A.17) 
is concentrated on a /x-negligible set. To this aim, let us consider the 
problem 


inf{/x(A) : Ae #, a is concentrated on A}. 


By taking the intersection of a minimizing sequence it is easy to check 
that also this problem has a solution A; we have to show that /x(A) = 0. 
By the minimality of A, the implication 




holds. Let us consider the numbers 


f/, := sup{/x(B) : #3ficA, xbH > 2%ct 


and let us prove that £/, —0 as /z —oo. Given maximizers /!/, c A, 
whose existence is easy to check, we have /x( B h ) > 2 h a(B h ) and in 
particular ^ /; o ( B h ) < oo. Hence 


a = 0 
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and (A. 18) tells us that necessarily 


0 = /x I limsup#/, I > limsup/x(5/,). 

\ h — >oo / h— >oo 

Let us show now that the maximality of B h implies that /x(C) < 2 h a(C) 
for any set C c A \ B h , i.e. ti A \ Bh /i < a. Indeed, if there is Co C A \ B h 
with /u (Co) > 2 / '<t(Co), the maximality of B h provides a minimal integer 
h\ > 1 and Ci C Co satisfying /x(Ci) < 2 /? cr(Ci) — \/h\. Let us 
consider Co \ C\\ we still have n(Co \ C\ ) > 2 /, cr(Co \ Ci) and the 
maximality of B h provide a minimal integer h 2 > h\ and C 2 C Co \ 
Ci satisfying /i(C 2 ) < 2 h o(C 2 ) — \/h 2 . Continuing in this way we 
have a nondecreasing sequence (/?,•) of integers and (C, ) C JF such that 
/x(C ( ) < 2 /, <r(C;) — 1 / /;, and Ci C Co \ C ; - for all i > 2; moreover 

/?,■ is the least integer for which there is such C, . Now lim, hi = oo, since 
the Ci are pairwise disjoint. Setting C = Co \ Uj°C,-, for all F e & 
contained in C, since F c Co \ Uj _1 C 7 - for all / > 2, we have /x(F) > 
2''a(F) — l/(hi — 1) (if hi > 2) and then n(F) > 2 h o{F). Hence B h UC 
is an admissible set for the maximum problem defining £/,, against the 
maximality of B h . 

We choose h in such a way that £/, < /x(A) and set t = 2~ h , B = A\B^ 
in (A.17). From (A.17) we conclude that /x(5) = 0, contradicting the fact 
that /x(5) = /x(A) — £/, > 0. 

Exercise 6.19. Let v = v + — v~ and let v + = v+ + u+, v~ = v~ + v~ be 
the Lebesgue decompositions with respect to n of v + and v~ respectively. 
Then, v a := \F — v~ and := — v“ provide a decomposition v = 

v a + v s with v a , v s signed, \v a \ «; /x and |v,| _L /x. 

If /x is signed and A provides a Hahn decomposition of /x (i.e. /x + (F) = 
/x(FHA) and/x“(F) = —/x(FnA c )), we repeat the decomposition above 
in A, relative to v and /x + , and in B = A c , relative to v and /x“. Denoting 
by iC + vj 4 and + vf the two decompositions obtained, 

v a (E) := v^ECA) + v B a {ECB), v s (E) := vf(E nA) + vf(Fn5) 


provides the desired decomposition v = v a + u v with |iy | <<C |/x| and 
\v s \ -L |/x|. 

The uniqueness of these decompositions can be proved with the same 
argument used in the case of nonnegative measures. 

Exercise 6.20. Let and let ( B,) be a jF-measurable partition of 

B \ since 


i =0 




\f\dfi= f \f\dn, 
J B 
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we obtain that \f/i\(B) < | f\ii(B). To prove the converse inequality fix 
s > 0 and define B, = B r 1 / _1 (/,), where /, = e[i, i + 1), i £ Z. Since 
the oscillation of \f — si | and ||/| — e|/|| in /“'(/,) are less than e, we 
get 



—einiBi) 


< en(Bj), 



d /r—e|/|/x(5,) 


< Sfi(Bi), 


hence 


It follows that 


L 


I/I dfi 


f dii\ 


< 2 six(Bi). 


£|/m(*/)| 

ieZ 


I] / >J 2 f 

ieZ ieZ 

[ \f\dfi-2e/i(B). 

J B 


|/| d/r — 2 sii(Bi) 


Since £ is arbitrary the converse inequality follows. 

Exercise 6.21. If x < 0 or x > 1 all repartition functions are respectively 
equal to 0 or 1, so we need to consider only the case v e [0, 1). The 
repartition function of %o, 1 obviously is equal to x, while 


/x/,((-o o,x]) 


#{i £ [1, h] : i < hx} 
h 


[hx] 

~h~ 


where [.?] denotes the integer part of s. Using the inequalities 5 — 1 < 
[5] < s with s = hx we obtain that /x/,((—oo, x]) —> x. 

Exercise 6.22. The argument is similar to the one used in the proof of 
Theorem 6.27: if y < x < y' and y, y' e D we have 


F(y) = lim F h (y) < lim inf Fj,{x) < lim sup F/,(x) 

h —>oo h—> oo h > oo 

< lim F h (y') = F(y'). 

h—>oo 


Letting y f x and y' x, we conclude. 

Exercise 6.23. We define a _ h 2 = /i((—oc. —h\) and, for — h 2 < i < h 2 , 
cij = fi((i — 1 )//z, i/h]). Let us denote by /x/ ; the measure obtained in 
this way. If x e (—h, h] and i is the smallest integer in (—/r, h 2 \ such 
that x < i/h, we have 


li 


( 


( 1 

V h 



i -1 

Y. at < /x/,((-oo,x]). 

j=—h 2 
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If x is not an atom of /x, this proves that 

liminf/x/jfX—oo, x]) > /x((—oo,x]). 
h 


Analogously 


l 1 



+ 




a-, > /X/,((-oo, x]). 

j=—h 2 


If x is not an atom of /z, this proves that 


limsup/z/,((—oo, x]) < /z((—oo,x]). 

h 


Exercise 6.24. Let us assume that (6.31) holds. If F,(x) —> 1 as x —> 
+oo uniformly in i e /, for any e > 0 we can find x such that 1 — F, (x) < 
e/2 for all i e /. Analogously, we can find y < x such that F,(y) < e/2 
for all i e I. Then, the interval I = (y, x] satisfies /x, (/) > 1 — e for all 
i G /, because I c — (—oo, y] U (x, +oo). 

Exercise 6.25. If /x is the weak limit and e > 0 is given, let us choose 
an integer n > 1 such that /z([l — n, n — 1]) > 1 — s and points 
x e (—n, 1 — n) and y e (n — 1, n) where the repartition functions of /x/, 
are converging to the repartition function of /x. Then, since /x((oo, x]) + 
1 — /x((—oo, y]) = /x(M \ (x, y)) < e, there exists n E e N such that 
sup (i>;)E /x„((oo, x]) + 1 — /x„((—oo, y]) < e. Let now x' and y' be satis¬ 
fying 

/x„((oo, x']) + 1 — /z„((—oo, y']) < e V« = 0, ..., — 1. 

Then, the interval I = [min{x, x'}, max{y, y'}] satisfies inf„ /x ( ,(/) > 

1 — e. 

Exercise 6.26. 

(a) lim, ; f R g dji h = f R g d/z Vg e C fo (M) (that is, (6.32)); 

(b) lim,, f R g dfi h = f R g d/i Vg e C c (M); 

(c) F h converge to F on all points where F is continuous; 

(d) F h converge to F on a dense subset of R; 

(e) lim,, /z,,(R) = /x(R); 

(f) (n h ) is tight. 

We consider the functions Ph(x) := p(x+h), where p(x) = (2tt) 2 

is the Gaussian, and /x^ = p h X (A being the Lebesgue measure), /x = 0. 
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In this case (c), (d), do not hold, because F h (x) —> 1 ^ 0 = F(x) for all 
x e R, (e) does not hold and (b) holds. 

a =>■ b, e. This is easy, because C c (R) C C*(R) and 1 R e C/,(R). 
a =>■ c. This follows by second part of the proof of Theorem 6.28. 
d <S> c. This is Exercise 6.22. 

bAe => c. This follows by the same argument used in the proof of second 
part of Theorem 6.28: the sequence (g*) monotonically convergent to l A 
can be chosen in C c (R), and this shows that liminf* /X/, (A) > n(A) for 
all A c R open. Using (e) and passing to the complementary sets, we 
obtain lim sup* /z*(C) < /i(C) for all C C R closed. 
d =>■ /. This follows by the same argument used in the solution of 
Exercise 6.25. 

d A f =} e. For all x € D, with D dense, we have lim/, /x*((—oo, x]) = 
/x((—oo,x]). Since /z*((—oo, x]) —>■ /z*(R) as x —► +oo uniformly in 
h, we can pass to the limit as x e D +oo to obtain lim* /i*(R) = 
lim A ^ +00 /x((-oo, x]) = /z(R). 

d A f =$■ a. This follows by the same argument used in the first part of 
the proof of Theorem 6.28, choosing the points tj in the partitions to be 
in the dense set where convergence occurs. 

Exercise 6.27. Set 

g($) := 1 f e^ x e~ x2/(2a2) dx. 

y/2jta 2 Jr 

Notice that g(0) = 1, and that differentiation theorems under the integral 
sign l2) and an integration by parts give 


g'G) 


y/ln 


= f ■>' 

7 TO 2 Jr 


* x (xe~ x /{2 °A )dx 


a 2 f . d 


L‘< 


\l2iza 2 Jr dx 

Ho 2 


S* e -FiaF-) dx 


\/2no 2 


J* 


Sx e -x 2 /(2a 2 ) dx _ 


Therefore g satisfies the linear differential equation g'($) = —cr 2 £g(£), 
whose general solution is g($) — ce~ a ~^! 2 . Taking into account that 
g(0) = l,c = l. 


® In this case, the application of the theorem is justified by the fact that sup^ 6 / \2Le'$ x e - v_ /(2a' _ )| 
is Lebesgue integrable for all bounded intervals I 
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Exercise 6.28. Let us approximate /x by /x„ = using the in¬ 

equality 

\ e i% x _ e ir l x \ < | x ||£ _ rj \ x, 

we obtain that 

IA«(£) - A«(f)I < l£ - V\ f \x\d/j, n (x) <n\% -r)\, 

Jr 

therefore /x„ is uniformly continuous. Since |/x«(£) ~ /x(£)l < /x(M \ 
[— n, n |), we have that /x„ —> jl uniformly as n —> oo, therefore /x is 
uniformly continuous (indeed, given e > 0, find n such that sup |/x„ — 
/x| < s/2 and 8 = s/(2n) to obtain |/x„(£) — jX n (r])\ < e/2 whenever 
|§ - p| < 5, and then |/x(£) - A0?)l < e). 

Exercise 6.29. Obviously |/x(£o)l = L and we set c = /x(§o) = e' e for 
some 0 e M. Since 

f |1 — ce ,x ^°\ 2 d/x(x) = 2 — cc — cc = 0, 

Jr 

we obtain that e lx ^° — c for /x-a.e. x e M. This implies that x/ 0 — 9 e 
2 jtZ for /x-a.e. x € M, so that /x is concentrated on the set of points 
{(2/7 7T + 9) /i;o}, !G f>j, and it suffices to set xo = 9 /% 0 to obtain the stated 
representation of /x as a sum of Dirac masses. 

Obviously | A| = 1 if/x is a Dirac mass. Conversely, if |/x| = l,wefindxo 
with /x({vo}) > 0 and £o, §o g M \ {0} with £o/£o ^ Q t0 obtain that /x is 
concentrated on the set {2«7r/§o+*o}neN an d on the set {2/i7r/£g+xo}neN- 
By our choice of §o and //, the intersection of the two sets is the singleton 
{jcq}, and this proves that fi = 8 Xo . 


Chapter 7 


Exercise 7.1. Let C > 0 be such that \H(x) — H(y) | < C\x — y| for all 
x, y e M. Lete > 0 and let <5 > 0 be such that Yi |/(h,) — /(a,)| < e/C 
whenever Yi( b > ~ a i) < s - We have Yi I H{f{b;)) - H(f(a t ))\ < 
C Yi I f(bi) ~ f( a i )I whenever Yi(^i — ad < <5. In particular, choosing 
/(f) = f, we see that Lipschitz functions are absolutely continuous. 

Exercise 7.2. We assume that both «£? 1 (F) > 0 and 1 (R \ E) >0. Let 
a e M be such that oo) 0 E) > 0 and 2zf '((a, oo) \ F) > 0, and 

define F(t) = (En(a, t )). By our choice of a, F{t) and ( t—a) — F(t) 
are not identically 0 in (a, +oo). 

If t > a is a rarefaction point of F, we have 


F(t + fc)-F(o ^((M + zonF) 

F, (f) = lim-= lim-= 0. 

^ hi o h hi o /x 


/x 


/x 




180 Luigi Ambrosio, Giuseppe Da Prato and Andrea Mennucci 


Analogously, F'_ (t ) = 0 and we find that F' is equal to 0 at all rarefaction 
points. A similar argument proves that F' = 1 at all density points. Let 
now to e (a, oo) where 0 < F(to) < (Jo — a) and apply the mean value 
theorem to obtain t' Q e (a, t 0 ) such that 

F(to) = (to-a)F'(t'o). 

By our choice of t 0 it follows that F'(t' 0 ) e (0, 1), a contradiction (be¬ 
cause either t' 0 is a density point or a rarefaction point). 

Exercise 7.3. Assume first that <p is continuous and bounded. Let H(z) ■— 
f z f(a) (p(y ) cly. By the (classical) fundamental theorem of the integral cal¬ 
culus, H is differentiable and H'(z) = <p(z) for all z e /(/). By the 
chain rule and Exercise 7.1, the function 

r f (0 

F(t):= <p(y)dy = H(f(t)) 

Jf(a ) 

is absolutely continuous and it has derivative equal to ) = 

<) at all points t where / is differentiable. On the other hand, 
still by the fundamental theorem of the integral calculus, the function 

G(t) := f cp(f(x))f(x)dx 
Ja 

has derivative equal to (yp o f)f F£ ! -a.e. in [a, b\. Since both F and G 
vanish at t = a, they coincide. 

By the dominated convergence theorem, the identity of the two func¬ 
tions persists if (p = t A , with A open (because t /A is the pointwise limit of 
continuous functions). By applying Dynkin’s theorem to the class ./# of 
the sets E e ^ (/(/)) such that 1 E (y)dy = f a l E (f(x))f\x) dx 
we obtain that the formula holds for all tp = t E with E Borel. Eventu¬ 
ally we obtain it for simple functions and, by uniform approximation, for 
bounded Borel functions. 

Exercise 7.4. Choosing g = 1 /V , by Exercise 7.3 we get {N) f dx = 
0, because 1^0/ = Let h + and h~ be respectively the positive 

and negative part of f ly-i (Ar) . Since 

nb pb pb 

/ h + dx— h~ dx — / dx = 0 

J a J a J a 

for all intervals (a, b), it follows that h + = h~ .if-a.e. in M. As a 
consequence, f' = 0 in 
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Chapter 8 

Exercise 8.1. Both are measures in (Z, J/F). If B e ZZthen gofyn(B) = 
/r(/ _1 (g _1 (B))), because (g o /) _1 = f~ l o g _1 . On the other hand, 

g#{f#H)(B) = )) = n{f-\g~\B))). 


Exercise 8.2. Let n > 1 integer, 0 < A < 2" and let us consider the 
interval I = [k/ 2", (A' + l)/2”). Then, / _1 (7) is the cylindrical set of all 
binary sequences a<)U\ ■ ■ ■ such that ao ■ ■ ■ a n -\ is the binary expression 
of k. It follows that 

x (i*. +)(/-(')) 

because their common value is 2~ n . On the other hand, f~ l ({1}) consists 
of a single point and therefore the identity above holds for I = {1}, the 
common value being 0. By additivity the identity holds for finite unions 
of sets of this type, a family stable under finite intersections. By the 
coincidence criterion the two measures coincide. 

Exercise 8.3. Let A c K be a dense open set whose complement C has 
strictly positive Lebesgue measure (Exercise 1.9), and let 

cp(t) := min {1, dist(t, C)} leR, 


By construction the function <p is continuous, nonnegative, bounded by 
1, and vanishes precisely on C. Then, set 


Fit) := 


/' 


Jf 


(pis) ds if t > 0; 
(p(s) ds if t < 0. 


We have F’ = q>, so that F e C l and its critical set C F = C has positive 
Lebesgue measure. It follows that F#J£ l is not absolutely continuous 
with respect to . Finally, since (pdt > 0 whenever a < b (because 
A fl (a, i) / 0) we obtain that F is strictly increasing. 

Exercise 8.4. Recall that F{Cp) is always Lebesgue negligible, regard¬ 
less of any injectivity assumption on U. Hence, possibly replacing U by 
U\Cp we can assume with no loss of generality that Cp = 0, i.e. DF(x) 
is nonsingular at any x e U. Recall that, according to the local invert- 
ibility theorem, for any x £ U there exists a ball B, (x) contained in U 
such that the restriction to F is injective. Now, following the strategy of 
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Lemma 6.9 we can cover U by a sequence of right open cubes { Q; } ;e/ , 
pairwise disjoint, such that the restriction of F to a neighbourhood of 
Qi is injective (we keep dividing a cube until this property is achieved). 
Let Qi = /\ i=l [cii, a, + <5); for h, < <7, sufficiently close to a, and 
Q, = A /=1 (bj , b, + <5) we have (by injectivity of F on Qi) 


F # (t dl S? n ) = 


1 


11 - FF n 


Qi \J f \oF~' f ™ 

and therefore we can pass to the limit to get 


F#{l Qi J? n ) = 


1 




I/fIoF- 1 ^) F(Qi) 

If we add both sides with respect to i € I we get 

= E , = E 7777 


tr |/F|of->(y) 


«F-hr) 1 <*> 
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